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This  dissertation  is  principally  concerned  with  incompleteness 
issues  in  the  design  of  artificial  intelligence  languages.  Major 
sources  of  incompleteness  are  the  pattern  matching  and  inference 
facilities  of  the  languages.  Incompleteness  in  the  area  of 
pattern  matching  can  be  repaired  by  developing  unification 
algorithms  for  the  specialized  data  types  of  the  languages.  A 
completer  but  potentially  infinite  unification  process  is 
described  for  arbitrary  data  types  in  general  and  is  applied  to 
the  QA4/QLISP  vector,  bag,  and  class  data  types.  Finite, 
complete  unification  algorithms  are  also  described  for  the  bag 
and  class  data  types.  The  bag  unification  algorithm  is  extended 
to  the  case  of  unification  of  first  order  predicate  calculus 
terms  with  functions  which  are  both  associative  and  commutative. 
Incompleteness  in  the  area  of  the  inference  system  can  be 
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repaired  by  use  of  some  form  of  the  « inference  procedure  which 
is  a complete  extension  derived  from  model  elimination  of  the 
problem  reduction  method.  This  can  readily  be  accomplished  in 
present  or  new  artificial  intelligence  languages  by  attempting  to 
derive  all  goals  in  the  context  of  the  asserted  negations  of  all 
higher  goals.  The  problem  of  compatibility  of  the  w procedure 
and  use  of  models  is  addressed  though  not  finally  resolved.  ' 
Design  of  and  experimental  results  for  the  theorem  proving 
program  PSTP  are  presented.  The  inference  system  used  is  a 
variant  of  the  t procedure  and  a version  of  PSTP  was  used  to 
verify  the  effectiveness  of  the  special  unification  algorithms 
developed. 
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1.  Introduction  ' | 

! 

This  thesis  is  broadly  concerned  with  the  areas  of  mechanical  j 

theorem  proving  and  artificial  intelligence  languages.  i 

First,  we  will  present  the  design  of  and  some  experimental 
results  for  a particular  theorem  proving  program:  the 
Progranunable  Strategy  Theorem  Prover  (PSTP)  . This  program  uses 

i 

the  newly  developed  linear  HSSON  procedure  as  its  inference 
system,  and  provides  great  flexibility  in  the  user  specification 
of  search  strategy  for  either  interactive  or  fully  automatic  use. 

The  remainder  of  this  thesis  will  be  devoted  to  a discussion 
of  completeness  issues  in  the  design  of  artificial  intelligence 
languages.  We  will  discuss  several  separate  areas  of 
incompleteness  in  these  languages  and  remedies  of  the 
incompleteness,  drawing  upon  the  field  of  mechanical  theorem 
proving  for  our  solutions. 

These  two  parts  of  this  thesis  are  linked  in  that  the  design 
of  and  experience  with  PSTF  motivated  some  of  the  ideas  for 
remedying  incompleteness  in  artificial  intelligence  languages, 
and  ideas  developed  in  consideration  of  artificial  intelligence 
languages  have  been  tested  and  incorporated  into  PSTP. 

1.1.  Artificial  intelligence  languages 

We  mean  by  the  term  artificial  intelligence  languages 
languages  such  as  PLANNER  [21] , CONNIVER  [33]  , and  0A4/0LISP 
[43,37,53,44].  These  are  high  level,  problem  solving  languages 
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designed  to  be  used  as  programming  languages  for  artificial 
intelligence  research.  A survey  of  these  and  other  languages  for 
artificial  intelligence  research  is  provided  by  Bobrow  and 
Raphael  ( 7] . 

These  languages  can  be  characterized  by  and  differentiated 
from  other  languages  by  possessing  certain  common 
characteristics. 

First  of  these  is  some  facility  for  pattern  matching.  Pattern 
matching  facilities  are  provided  for  associative  retrieval  of 
data,  for  data  item  decomposition,  and  for  data  item  composition. 
Patterns  and  data  items  are  constructed  of  such  things  as 
constants  (PLAKNER,  COKNIVER,  QA4/QLISP) , simple  variables 
(PLANNER,  CONNIVER,  QA4/0LISP) , fragment  variables  (PLANNER, 
QA4/0LISP,  partially  in  CONNIVER) , pattern  matching  actors  or 
constraints  (PLANNER,  QA4/QLISP) , lists  (PLANNER,  CONNIVER, 
0A4/QLISP) , and  bags  and  classes  (0A4/0LISP) . 

The  second  major  feature  of  artificial  intelligence  languages 
is  a capability  for  automated  dS^U£ti2Q«  There  is  a great  deal 
of  similarity  in  the  deduction  mechanisms  of  the  various 
artificial  intelligence  languages.  PLANNNER,  CONNIVER,  QA4/QLISP 
all  use  backward  chaining  via  the  GOAL  statement  and  CONSEOUENT 
theorems,  the  FETCH  statement  and  IF-NEEDED  methods,  and  the  GOAL 
statement  and  APPLY  TEAM  functions  respectively.  These 
essentially  identical  (except  for  terminology)  mechanisms  we 
consider  to  be  direct  use  of  the  problem  reduction  method 
explicated  by  Nilsson  [35] — reducing  (in  multiple  ways  to  deal 
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with  alternative  possible  solutions)  a goal  to  a set  of  subgoals, 
each  of  which  must  be  solved  in  order  to  solve  the  parent  goal. 


They  also  provide  facilities  for  updating  the  world  state 
maintained  by  the  system  when  data  items  are  added  to  or  deleted 
from  the  data  base.  Again,  these  mechanisms  are  essentially 
identical  (except  for  terminology)  in  PLANNER,  CONNIVER,  and 
QA4/0L1SP.  Each  language  provides  the  capability  for  performing 
arbitrary  actions  (but  usually  additions  to  or  deletions  from  the  | 

data  base,  representing  the  making  of  subsidiary  deductions  from  j 

an  altered  state)  when  there  are  additions  to  or  deletions  from 
the  data  base.  These  actions  are  performed  by  the  ANTECEDENT  or 

i 

ERASING  theorems  of  PLANNER,  by  the  IF-ADDED  or  IF-REMOVED 
methods  of  CONNIVER,  and  by  APPLY  TEAM  functions  of  CLISP  (demons 
in  QA4) . 

The  third  principal  feature  characterizing  artificial 
intelligence  languages  is  sgecial  control  structures,  and  the 
data  structure  required  to  support  them.  I 

The  first  of  this  generation  of  artificial  intelligence 
languages,  PLANNER,  "introduced”  backtracking  as  a mechanism  for 
handling  alternatives  in  the  search  for  a solution  in  the  problem 
reduction  space.  A limited  parallelism  was  also  allowed. 

Reacting  against  the  inefficiency  of  backtracking,  CONNIVER 
provided  a number  of  search  control  primitives,  and  made  these 
accessible  to  the  user. 

QA4/QLISP  also  implemented  backtracking,  and  provided 
coroutines  to  obtain  some  parallelism  and  non-backtracking 
alternation  of  search  domain. 
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1.2.  Completeness 


What  we  iTiean  by  completeness  of  artificial  intelligence 
languages  requires  some  explanation. 

In  a very  general  sense,  all  these  languages  are  complete,  as 
are  most  other  computer  languages.  There  exist  several 
definitions  of  computable  functions:  recursive  function  theory, 

Turing  machines,  Markov  normal  systens.  They  all  turn  out  to 
define  the  same  set  of  functions,  and  are  believed  (Turing's 
thesis.  Church's  thesis)  to  define  the  class  of  all  computable 
functions.  All  the  artificial  intelligence  languages  are 
theoretically  capable  of  computing  these  functions  (they  can  all, 
for  example,  be  used  to  program  a Turing  machine  simulator). 

However,  this  is  a very  weak  definition  of  language 
completeness  which  almost  entirely  fails  to  distinguish  between  | 

any  computer  languages:  the  weakest  assembly  language  and  | 

! 

PLANNER,  and  every  language  in  between,  are  all  complete  in  this  ] 

sense.  > 

I 

The  definition  of  completeness  as  applied  to  mechanical  | 

! 

theorem  proving  procedures  is  that  a procedure  is  complete  if  and  | 

only  if  (given  sufficient  resources  (time  and  space) ) it  is  | 

capable  of  finding  a proof  for  any  properly  posed  theorem.  Since  | 

a major  part  of  what  characterizes  artificial  intelligence 
languages  is  a facility  for  deduction,  this  definition  is 
applicable  to  evaluating  the  completeness  of  artificial 
intelligence  languages. 
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The  connection  between  completeness  of  theorem  proving 
procedures  and  artificial  intelligence  language  completeness  is 
most  direct  in  the  area  of  this  deduction  mechanism. 

The  principal  difficulty  of  completeness  with  this  deduction 
mechanism  is  not  with  the  state  updating  operations,  but  with  the 
problem  reduction  method  itself.  The  problem  reduction  method  is 
an  adequate  and  complete  deduction  mechanism  provided  the  logical 
negation  of  a goal  has  no  meaning.  However,  as  soon  as  negations 
of  goals  are  permitted,  the  problem  reduction  method  is  incapable 
of  performing  sorae  valid  deductions. 

We  will  discuss  an  extension  of  the  problem  reduction  method 
which  is  logically  complete  for  the  first  order  predicate 
calculus.  This  method  is  essentially  an  abstraction  of  the 
linear  MESON  procedure  used  in  PSTP. 

We  can  regard  a goal  as  representing  a theorem  to  be  proved; 
we  can  regard  PLANNER  CONSEQUENT  theorems  to  be  axioms;  the 
pattern  of  the  consequent  theorem  is  the  consequent  of  the  axiom, 
the  body  is  the  antecedent.  Our  question  concerning  completeness 
can  be  posed  as:  if  we  encode  axioms  as  CONSEQUENT  theorems,  and 
a theorem  to  be  proved  in  this  way,  will  PLANNER  always  discover 
a proof?  The  answer  is  no,  for  several  reasons. 

One  reason  is  that  the  PLANNER  deduction  mechanism  must,  in 
general,  have  access  to  contrapositives  of  axioms  as  well  as  the 
axioms  themselves.  Thus,  if  we  encode  A •»  B as  <CONSEQUENT  (] 

[B]  <GOAL  [A]>>,  we  must  also  present  <CONSEQUENT  (]  [NOT  [A]] 
<GOAL  [NOT  [Bl]>>.  This  we  consider  to  be  a responsibility  of 
the  user,  and  not  a completeness  deficiency  of  the  language. 
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The  second,  more  crnciAl  reason  is  the  logical  incompleteness 
(relative  to  the  first  order  predicate  calculus,  or  the 
propositional  calculus)  of  this  deduction  mechanism,  which  is 
merely  an  implementation  of  the  problem  reduction  method. 

The  point  is  that  the  user  can  reasonably  expect,  or  at  least 
hope,  that  the  deduction  mechanism  that  PLANNER  provides  is 
complete — that  it  will  solve  the  goal  if  it  is  logically  possible 
to  solve  it. 

The  incompleteness  of  the  deduction  mechanism  can  be 
"remedied"  in  various  ways.  One  way  is  to  just  ignore  the 
problem,  making  it  a user  responsibility  to  program  whatever 
additional  inference  mechanisms  he  needs  for  completeness.  This 
is  essentially  the  position  of  present  artificial  intelligence 
languages.  Another  solution  is  to  provide  within  the  language 
some  backup,  complete  deductive  facility.  This  is  actually  done 
in  PLANNER,  which  has  a resolution  theorem  proving  subsection.  A 
third  solution  is  to  alter  the  deduction  mechanism  so  that  its 
use  now  meets  the  user  expectation  of  completeness  directly, 
rather  than  using  a separate  facility  to  obtain  completeness. 

This  last  approach  is  the  one  being  explored  in  this  thesis. 

In  general,  our  approach  is  to  obtain  completeness  in  a form 
compatible  with  present  concepts  and  structures  of  artificial 
intelligence  languages,  and  without  significantly  adversely 
affecting  their  applicability  to  and  performance  on  present 
applications. 

The  second  area  of  incompleteness  is  in  pattern  matching. 
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since  pattern  matching  in  the  deduction  mechanism  plays  the  role 
of  unification  in  resolution,  it  is  apparent  that  for 
completeness,  pattern  matching  must  be  as  general  as  unification, 
i.e.,  it  must  be  able  to  represent  every  possible  match. 

Even  disregarding  the  functional  analogy  to  unification,  it  is 
a reasonable  expectation  that  every  possible  match  be  returned  by 
the  pattern  matching  algorithms. 

Pattern  matching  as  custcjiiarily  used  uses  an  expression  called 
the  pattern  expression  (pattern)  and  another  expression  called 
the  target  expression  (target).  In  its  customary  usage,  pattern 
matching  is  not  symmetric;  the  permitted  structure  differs  for 
patterns  and  targets.  In  general,  variables  and  pattern  matching 
actors  are  restricted  to  appearing  in  patterns;  they  may  not 
appear  in  targets.  Thus,  the  task  of  pattern  matching  consists 
of  finding  assignments  for  variables  and  processing  pattern 
matching  actors  such  that  the  resulting  instantiated  pattern  is 
identical  (or  equivalent))  to  the  target. 

We  will  show  that  for  some  purposes,  it  is  important  that 
pattern  matching  be  symmetric.  Specif icially,  we  will  not  want 
to  restrict  variables  and  actors  to  appearing  in  only  one  of  the 
expressions,  but  will  want  to  allow  both  expressions  to  be 
equally  general. 

So  long  as  pattern  matching  severely  limited  the  form  of  the 
target,  pattern  matching  is  a comparatively  simple  task,  and 
completeness  is  easily  maintained.  However,  as  the  specification 
of  targets  becomes  more  general,  the  pattern  matching  task 
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becomes  progressively  more  difficult,  ultimately  reaching  the 
stage  where  it  is,  in  general,  an  undecidable  problem  to 
determine  if  two  expressions  have  any  common  instances. 

We  will  consider  in  detail  the  pattern  matching  of  QA4/QLISP. 
CONNIVER  pattern  matching  is  either  already  complete,  or  could 
readily  be  made  so.  PLANNER  patterns  are  general  enough  to 
create  the  problem  of  potential  undecidability.  The  principal 
pattern  matching  facilities  of  QA4/QLISP  provide  an  intermediate 
level  of  theoretical  difficulty.  Furthermore,  they  (vectors, 
bags,  and  classes)  are  important  mathematical  objects,  and  the 
study  of  pattern  matching  for  them  has  significance  beyond  their 
immediate  application  to  QA4/QLISP. 

A third  area  of  incompleteness  in  the  control  structure  of 
programs  written  in  these  artificial  intelligence  languages.  A 
reliance  upon  depth  first  search  (with  backtracking  on  failure) 
is  necessarily  incomplete  whether  the  depth  is  unbounded 
(permitting  infinite  digressions  not  leading  to  a solution)  or 
bounded  (possibly  eliminating  all  solutions  due  to  extreme 
depth) . It  is  essential  for  completeness  that  there  exist  the 
capability  for  maintaining  simultaneously  multiple  potential 
solutions  of  arbitrary  size.  The  searching  procedures  of  PSTP 
(essentially  general  heuristic  search)  provide  one  solution  to 
this  problem. 

For  efficiency  reasons,  as  well  as  incompleteness  reasons, 
there  has  been  a strong  move  against  use  of  backtracking  in  later 
design  and  use  of  artificial  intelligence  languages.  Though 
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practical  m*attets  remain  (like  how  can  one  easily  specify  a 
search  procedure),  thn  theoretical  issues  are  well  understood  and 
widely  recognized,  so  we  will  have  little  to  say  about  this  area. 

One  contribution  artificial  intelligence  languages  have  made 
is  in  this  area  of  richer  control  structure.  This  thesis  is 
concerned  with  the  intersection  of  the  areas  of  mechanical 
theorem  proving  and  artificial  intelligence  languages. 

Mechanical  theorem  proving  provides  many  insights  into  how 
artificial  intelligence  languages  may  be  made  complete;  the 
overall  design  and  suggested  control  structures  of  artificial 
intelligence  languages  provide  new  ideas  for  structuring 
mechanical  theorem  proving  programs. 

A fourth  area  of  incompleteness  relates  to  our  solution  of  the 
incompleteness  problem  for  the  deductive  system  of  artificial 
intelligence  languages.  This  new  problem  concerns  the  use  of 
models.  Use  of  models  in  the  form  of  either  examples  or 
abstractions  of  the  actual  problem  is  highly  compatible  with  the 
problem  reduction  method  which  is  the  inference  system  of 
artificial  intelligence  languages.  Use  of  models  cannot  be 
arbitrarily  added  to  otherwise  complete  inference  systems  without 
possibly  losing  completeness. 

The  problem  reduction  method  remains  complete  (within  its 
domain  of  no  negated  goals)  with  a very  general  usage  of  models. 

There  is  substantially  more  difficulty  in  making  its  complete 
extension  compatible  with  models. 

We  will  present  some  examples  which  demonstrate  within  our 
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comolete  inference  system  the  necessity  foe  attemptinq  to  solve 
goals  which  are  known  to  be  unsolvable.  We  will  also  present 
modified  inference  procedures  which  are  syntactically  less 


restrictive  but  are  more  compatible  with  the  use  of  models. 

1.3.  Control  of  theorem  proving  programs 

Control  is  a critical  issue  in  the  design  and  use  of  present 
automatic  theorem  proving  programs.  Previous  theorem  proving 
systems  are  criticized  in  the  next  chapter  for  the  lack  of 
ability  to  control  the  expansion  of  the  search  space  except 
usually  by  setting  a few  global  parameters.  The  approach  of  PSTP 
is  one  proposed  solution  to  this  problem,  with  user  programmed 
search  strategies  and  filters  ordering  and  pruning  the  search 
space. 

Another  alternative  solution  to  the  problem  is  suggested  by 
the  system  of  Henschen  and  Wos  (191  which  provides  a theorem 
proving  language  to  manipulate  lists  of  clauses,  directing 
resolutions  be  performed  on,  for  example,  the  top  clauses  of  the 
first  and  second  lists.  This  allows  for  the  easy  programming  of 
most  standard  resolution  refinements. 

Both  systems  are  awkward  for  specifying  certain  kinds  of 
strategy  to  the  theorem  prover. 

PSTP's  search  order  capability  is  completely  general,  although 
it  may  be  difficult  in  any  particular  case  to  write  the 
appropriate  numerical  evaluation  function.  (Given  sufficient 
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numerical  range,  any  sequence  of  tents  may  be  represented  by  the 
sum  of  shifted  values  of  the  tent.)  It  is  furthermore  limited  to 
specification  of  search  strategy;  it  has  essentially  no 
capability  for  altering  the  inference  system  being  used.  Thus, 
it  is  very  difficult  to  change  PSTP  to  use  hyper-resolution 
rather  than  the  linear  MESON  procedure. 

Henschen's  system  is  also  limited  in  its  ability  to  specify 
inference  systems.  His  system  would  have  similar  difficulty  to 
PSTP  in  changing  from  resolution  to  the  linear  MESON  procedure 
since  his  system  only  understands  clauses,  not  chains. 

Some  kinds  of  strategy  information  are  difficult  to  provide  to 
henschen's  system  too.  The  basic  unit  manipulated  by  Henschen's 
system  is  the  list  of  clauses,  clauses  at  the  beginning  of  the 
list  being  resolved  upon,  and  newly  generated  clauses  being  added 
to  the  end  of  the  list.  Strategy  is  specified  primarily  by 
directing  which  clause  lists  are  to  be  used  presently.  There  are 
only  a small  number  of  clauses  lists  which  may  be  active  at  any 
given  time.  In  length  preference  strategies,  separate  lists 
would  be  maintained  for  clauses  of  length  1,  2,  ....  This  can 
effectively  be  done  in  cases  where  the  parameter  (like  length  in 
this  example)  takes  on  only  a small  range  of  values.  If  the 
parameter  has  a much  wider  range  of  values,  such  as  a size  or 
complexity  measure,  Henschen's  system  will  have  more  difficulty 
dealing  with  it  given  the  restricted  number  of  currently  active 
clause  lists. 

Thus,  one  of  the  objectives  of  this  thesis  is  to  do  the 
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preliminary  work  (i.e.,  investigating  issues  of  completeness  of 
artificial  intelligence  languages)  not  only  to  study  these 
languages  for  artificial  intelligence  problem  solving 
applications,  but  to  consider  application  of  these  type  of 
languages  to  the  theorem  proving  task.  The  use  of  an  artificial 
intelligence  language  with  the  required  logical  properties  (a 
standard  deduction  system,  and  completeness,  or  an  understanding 
of  where  completeness  is  lacking),  represents  an  interesting 
alternative  to  PSTP  and  Henschen's  theorem  proving  system  for 
specifying  strategy  in  attempting  a proof.  Although  present 
artificial  intelligence  languages  have  several  weaknesses  in 
their  logical  characteristics  which  this  thesis  tries  to  point 
out  and  remedy,  they  have  made  significant  contributions  in  the 
area  of  control,  and  it  is  these  contributions  which  we  hope  to 
see  used  in  future  theorem  proving  systems. 


1.4.  Inconsistency  and  Uncertainty 

The  practical  and  epistemological  suitability  of  formal  logic 
(and  specifically  the  first  order  predicate  calculus)  for 
performing  the  kinds  of  reasoning  demanded  of  artificial 
intelligence  programs  has  been  extensively  argued.  Some 
principal  objections  to  its  use  relate  to  formal  properties 
restricting  representation  of  inconsistency  and  uncertainty.  We 
attempt  to  answer  some  of  these  oojections  here  by  showing  that 
theorem  proving  programs  are  often  more  tolerant  of  irrelevant 
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inconsistency  than  is  often  believed,  and  that  it  is  possible  to  * 

develop  a logic  which  extends  both  fuzzy  logic  (for  representing 
uncertainty)  and  a four  valued  logic  (for  representing 
inconsistency.)  We  will  not  further  develop  the  properties  of 
this  logic  here,  but  this  logic  could  be  used  as  a basis  for 
answering  objections  to  more  orthodox  formal  theories. 

One  of  the  objections  to  the  use  of  mechanical  theorem  proving 
in  natural  reasoning  is  concerned  with  the  handling  of 

inconsistency  in  the  data  base.  1 

It  is  alleged  that,  following  the  definitions  of  the 
underlying  logical  system,  i.e.,  the  first  order  predicate 
calculus  in  which  any  inconsistency  results  in  the  truth  of 
everything,  the  presence  of  inconsistent  data  will  result  in  the 
generation  of  proofs  for  questions  unrelated  to  the  ' 

inconsistency. 

Even  restricting  our  attention  to  conventional  (first  order 
predicate  calculus)  logic,  this  allegation  is  false  in.  two 
respects. 

First,  the  existence  of  an  inconsistency  in  the  data  base  is 
no  guarantee  that  it  will  ever  be  discovered  in  reasonable  time. 

If  all  inconsistencies  were  so  readily  discovered  as  to  make  use 
of  data  bases  containing  inconsistencies  impossible,  then  the 
problem  of  creating  efficient  theorem  provers  would  be  solved  as 
they  themselves  generate  proofs  by  discovering  inconsistencies. 

Briefly  and  informally,  if  partial  deductions  are  generated  in  ,j 

increasing  order  of  their  coniplexity,  and  if  the  complexity  of  ■ 
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the  discovery  of  the  irrelevant  inconsistency  exceeds  the 
complexity  of  the  least  complex  proof  of  the  theorem,  then  the 
inconsistency  will  not  be  discovered  before  the  theorem  is 
proved.  Similarly,  if  (as  is  commonly  true  in  a theorem  prover 
based  question  answering  system)  the  question  is  not  a theorem, 
the  inconsistency  will  still  not  be  discovered  if  the  cost  of 
discovering  it  exceeds  the  resources  assigned  to  answer  the 
question. 

Second,  certain  mechanizations  of  first  order  predicate 
calculus  impose  their  own  relevance  criteria  on  items  in  the  data 
base  that  prevent  irrelevant  inconsistencies  from  entering  into 
an  attempted  proof.  One  of  the  most  powerful,  widely  used,  and 
intuitive  refinements  of  resolution  is  the  set  of  support 
strategy.  Briefly,  the  set  of  support  strategy  customarily 
divides  the  set  of  clauses  denoting  the  problem  into  two  parts: 
the  negated  theorem  (plus  possibly  special  hypotheses)  and  the 
axioms  (plus  possibly  special  hypotheses) . No  resolutions  are 
permitted  among  elements  of  the  second  set  (the  axioms;  every 
resolvent  must  have  as  a parent  a member  of  the  first  set  or  a 
descendant  of  a member  of  the  first  set.  Although  the  proof  of 
completeness  of  resolution  using  the  set  of  support  strategy 
prescribes  that  (for  completeness  to  be  maintained)  the  second 
set  must  be  consistent,  allowing  it  to  be  inconsistent  is  a 
sometimes  useful  generalization. 

If  there  is  no  way  for  the  theorem  prover  to  bring  the  clauses 
participating  in  the  irrelevant  inconsistency  into  the  search  for 
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a proof  of  the  theorem,  then  the  inconsistency  will  not  in  any 
way  affect  the  search. 


For  example,  consider  a data  base  consisting  solely  of  the  two 
assertions  A and  -»A.  According  to  the  definition  of  the  first 
order  predicate  calculus  and  the  resolution  theorem  proving 
procedure  without  any  refinements,  every  proposed  theorem  will 
have  a proof.  However,  if  the  set  of  support  strategy  is  used, 
only  A and  -lA  can  be  proved.  If  a proof  is  attempted  for  B,  it 
will  be  discovered  that  -*p.  (the  negation  of  the  theorem)  cannot 
be  resolved  with  any  other  clause.  The  resolution  of  A and  -A  is 
forbidden  by  the  set  of  support  strategy.  Thus,  no  proof  of  B 
will  be  found. 

Although  this  definition  of  relevance  (related  as  it  is  to  the 
failure  to  discover  inconsistencies  due  to  finding  no  set  of 
literals  linking  clauses  of  the  theorem  to  clauses  in  the 
inconsistency)  is  rather  syntactic  in  form,  it  appears  to  be 
compatible  with  conventional  notions  of  relevance.  If  a person 
held  the  inconsistent  beliefs  that  it  was  both  raining  and  not 
raining  at  some  fixed  point  and  time,  he  would  be  unlikely  to  let 
either  belief  interact  with  the  determination  of  some  other 
unrelated  question  such  as  the  location  of  a pencil.  The 
mechanical  theorem  proving  system  would  behave  similarly  since 
there  will  not  in  general  be  any  link  in  the  data  between  issues 
of  rain  and  pencil  locations. 

A mechanical  theorem  proving  system  using  the  set  of  support 
strategy  in  this  way  seems  to  deal  with  inconsistency  in  5 
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substantially  the  same  manner  as  a human  bcinq.  II  the 
inconsistency  is  a subtle  one,  the  human  will  simply  not  notice 
it,  and  it  is  not  likely  to  be  discovered  by  a theorem  prover 
since  the  cost  of  its  discovery  will  exceed  the  cost  ol  the 
cheapest  proof  or  the  resources  allocated  to  discovering  a 
nonexistent  proof.  Humans  can  successfully  deal  with  even 
blatant  inconsistencies  in  their  beliefs  by  recognizing  these 
beliefs  are  irrelevant  to  the  question  at  hand.  The  set  of 
support  strategy  similarly  prescribes  a relevance  criterion  which 
must  be  met  for  the  inconsistent  data  to  be  used. 

Although  unrelated  to  the  approach  of  this  thesis,  there  is 
yet  a third  way  in  which  formal  theorem  proving  systems  can  deal 
with  inconsistency.  This  way  is  to  extend  the  logic  being  used 
to  either  classify  facts  into  several  levels  representing  degree 
of  belief  in  them  or  to  directly  represent  inconsistency  in  the 
logic. 

The  first  of  these  possibilities  is  exemplified  by  Rescher 
[40] . Here,  beliefs  are  classified  into  modal  categories 
indicating  strength  of  belief.  If  a belief  system  is 
inconsistent,  it  can  be  made  consistent  by  stripping  away  beliefs 
by  modal  category  in  order  of  ascending  degree  of  belief. 

The  second  of  these  possibilities  is  exemplifed  by  Belnap 
[3,4].  Belnap's  model  is  a question  answering  system  which 
accepts  inputs  and  answers  queries.  In  contrast  to  Rescher 's 
system  in  which  one  must  effectively  assign  different  credibility 
to  different  beliefs  (placing  them  in  different  modal  ^ 
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categories),  Belnap  essentially  assumes  that  we  have  no  reason  to 
judge  the  credibility  o£  one  input  higher  than  that  of  another 
(possibly  contradictory)  input.  His  way  of  handling 
inconsistency  is  to  extend  the  conventional  two  valued  logic  to  a 
four  valued  logic.  The  four  values  are:  T (indicating  the  system 
has  been  told  the  proposition  is  true),  F (indicating  the  system 
has  been  told  the  proposition  is  false).  Both  (indicating  the 
system  has  been  told  the  proposition  is  true  and  the  proposition 
is  false) , and  None  (indicating  the  system  has  neither  been  told 
the  proposition  is  true  nor  the  proposition  is  false).  Belnap 
develops  truth  tables  computing  the  value  of  compound  expressions 
of  propositions  having  these  values  (they  are  quite  intuitive: 
for  example.  Both  a p is  F,  Both  v f is  Both,  -iBoth  is  Both)  and 
a mechanism  for  updating  the  state  of  the  question  answering 
system  as  new  assertions  are  made. 

Another  formalism  for  uealing  with  uncertainty  if  not  directly 
with  inconsistency  is  the  fuzzy  logic  of  Zadeh  [55]  . Here, 
propositions  are  assigned  truth  values  in  the  range  [(J,l]  rather 
than  {T,P}.  An  assignment  of  truth  value  1 is  essentially 
equivalent  to  an  assignment  of  truth  value  T in  conventional 
logic;  an  assignment  of  truth  value  0 is  essentially  equivalent 
to  an  assignment  of  truth  value  F in  conventional  logic.  In  this 
way,  fuzzy  logic  is  an  extension  of  the  conventional  two  valued 
logic.  Values  between  0 and  1 represent  further  degrees  of 
truth.  The  truth  value  of  -«A  is  1 minus  the  truth  value  of  A. 
Thus,  fuzzy  logic  is  noi.  capable  of  dealing  with  A and  -lA  both 
being  asserted. 
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However,  fuzzy  logic  can  itself  be  extended  to  considering  not 
just  numbers  in  the  range  (0,11,  but  pairs  of  numbers  in  the 
range  (0,1)  as  truth  values.  The  first  number  x in  the  truth 
value  <x,y>  denotes  the  degree  of  truth  or  degree  of  confidence 
in  the  truth  of  the  proposition  to  which  it  is  assigned;  the 
second  number  y denotes  the  degree  of  falsity  or  degree  of 
confidence  in  the  falsity  of  the  proposition. 

Truth  values  for  non-atomic  formulas  are  computed  by  the 
following  rules: 

-i<x,y>  a <y,x> 

<x 1 ,yi >v<xi ,yi>  = <max (X  I , xi ) ,min(y I ,y2 ) > 

<x  i,y  i>A<xi ,yi>  = <min (x 1 ,x j ) ,max (y I ,y2 ) > 

If  truth  values  are  restricted  to  <1,0>  and  <0,1>,  we  have  the 
conventional  two  valued  logic  in  which  <1,0>  denotes  T and  <0,1> 
denotes  F.  If  truth  values  are  restricted  to  <x,l-x>  x « (0,1], 
we  have  ordinary  fuzzy  logic.  If  truth  values  are  restricted  to 
<1,0>,  <0,1>,  <1,1>,  and  <0,0>,  we  have  the  four  valued  logic  of 
Belnap  in  which  <1,0>  denotes  T,  <0,1>  denotes  F,  <1,1>  denotes 
Both,  and  <0,U>  denotes  None.  If  truth  values  may  be  freely 
taken  from  <x,y>  x,y  e (0,1),  we  have  a new  logic  which  is  an 
extension  of  both  fuzzy  logic  and  Belnap's  four  valued  logic. 

This  last  new  logic  will  be  equally  capable  of  dealing  with 
inconsistency  as  Belnap's  four  valued  logic  yet  will  have  the 
additional  facility  of  representing,  for  example,  the  credibility 
of  information  providers  to  the  system  by  assigning  differing 
numerical  values  to  prepositions. 


An  important  point  about  this  generalization  of  fuzzy  logic 
and  Belnap's  four  valued  logic  is  that  the  theory  for  the  latter 
also  provides  a theory  for  the  new  logic.  The  operations  for 
computation  of  truth  values  and  computation  of  alteration  in  the 
data  base  when  assertions  are  made  are  based  in  theory  on  the 
truth  values  T,  F,  Both,  None  forming  approximation  and  logical 
lattices.  The  new  extended  set  of  truth  values  <x,y>  x,y  € 10,1] 
also  form  lattices  in  precisely  the  same  way,  so  that  operations 
and  their  justification  are  analogous  to  those  in  Belnap's  logic. 
These  results  will  be  presented  in  a separate  paper. 


2.  An  Inference  System  for  Artificial  Intelligence  Languages 
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2.1.  Introduction 

This  chapter  is  concerned  with  an  inference  system  for 
artificial  intelligence  languages.  Early  sections  of  the  chapter 
present  terminology  and  background  material  on  inference  systems 
related  to  the  one  we  propose  as  appropriate  for  artificial 
intelligence  languages.  In  particular,  Herbrand's  theorem  and 
resolution  and  model  elimination  procedures  are  described. 

The  MESON  and  linear  MESON  procedures  are  then  described. 

These  are  similar  to  model  elimination  except  for  their  more 
obvious  relationship  to  the  problem  reduction  method.  The 
Programmable  Strategy  Theorem  Prover  (PSTP)  theorem  proving 
program  using  the  linear  MESON  procedure  is  then  described  in 
substantial  detail  with  experimental  results. 

The  ^ theorem  proving  procedure  with  refinements  for  more 
efficient  handling  of  the  equality  relation  and  use  of  models  is 
then  described.  The  tr  procedure  is  a complete  theorem  proving 
procedure  which  seems  particularly  well  suited  for  use  in 
artificial  intelligence  languages.  It  is  essentially  an 
abstraction  of  the  MESON  and  linear  MESON  procedures  which 
ignores  representation  issues  and  merges  into  a single  problem 
reduction  like  operation  the  problem  reduction  operation  and  a 
form  of  reasoning  by  contradiction. 
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The  inference  system  which  is  uniformly  employed  by  artificial 
intelligence  languages  is  essentially  based  on  the  problem 
reduction  method.  This  however  is  an  incomplete  inference  system 
for  the  prepositional  or  first  order  predicate  calculus  since  its 
handling  of  the  negations  of  goals  is  incomplete,  allowing  for  no 
proof  by  contradiction  mechanism.  We  will  present  in  this 
chapter  a compatible  extension  of  the  presently  used  problem 
reduction  method  which  is  complete.  This  minimally  alters  the 
present  inference  system,  requiring  only  that  negated  goals  be 
asserted  in  the  context  in  which  their  proofs  are  being 
attempted.  This  can  easily  be  accomplished  even  in  present 
artificial  intelligence  lemguages  by  including  assertions  of 
negated  goals  at  the  beginning  of  the  code  of  consequent 
theorems. 

Especially  when,  as  here,  completeness  can  be  achieved  so 
easily,  we  regard  completeness  as  a desirable  property.  It 
assures  us  that  if  a proof  cannot  be  found,  it  is  not  the  fault 
of  the  inference  system.  Failure  to  find  a proof  must  be  a 
consequence  of  faulty  axiomati zation  or  incomplete  other 
components  of  the  language,  such  as  pattern  matching  or  search 
strategy.  This  is  important  since  incompleteness,  particularly 
incompleteness  in  the  inference  system,  is  usually  demonstrable 
even  with  very  simple  (and  thus  more  likely  to  occur)  examples. 
Certainly  the  examples  we  show  for  the  incompleteness  of  the 
problem  reduction  method,  and  for  the  first  proposed  model 
relative  ^ procedure  in  the  next  chapter  are  all  simple. 
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Besides  the  theoretical  desirability  of  completeness,  another 


i 
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reason  for  being  concerned  with  completeness  in  the  inference 
system  is  that  some  things  can  be  done  in  different  and  possibly 
superior  ways  using  a complete  system  rather  than  the  currently 
used  incomplete  systems. 

A good  example  of  this  is  the  generation  of  contingent  plans. 
In  QA4,  a general  schema  for  generating  contingent  plans  is  to 
assert  in  local  contexts  different  values  for  undetermined 
predicates,  construct  a plan  for  each  value,  and  synthesize  a 
contingent  plan  from  the  resulting  single  plans. 

Consider  the  QA4  example  of  trying  to  have  fun.  One  cein  have 
fun  at  the  beach  if  it  is  sunny,  or  can  have  fun  at  the  movie  if 
it  is  rainy  (it  is  assumed  it  is  either  sunny  or  rainy). 

This  can  be  axiomatized  as: 

Ex^ple  2.1 . 

1.  sunny  ♦ havef un ( at-beach) 

2.  rainy  ■»  havef  un  ( at-movie) 

3.  sunny  v rainy 

4.  havefun(s)  ♦ G 

5.  G 

Here,  we  have  introduced  the  global  goal  G rather  than  just 
using  goal  havefun(s)  as  the  top  goal.  This  is  done  for 
technical  reasons  so  that  the  top  goal  is  variable  free. 

The  QA4  approach  to  solving  this  problem  involves  generating 
and  combining  separate  plans  for  the  two  cases  concerning  whether 
it  is  rainy  or  sunny. 

Representing  these  separate  cases  in  MESON  procedure  proofs, 
we  have 

2.2. 
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1.  sunny  -*  havef  un  ( at-beach) 

2.  rainy  ♦ havef un { at-movie) 

3.1.  sunny 

4.  havefun(s)  G 

5.  G 

6.  IG]  hav.d£un(s) 

7.  IG]  [havef  un  ( at-beach)  1 sunny 

8.  0 


theorem 
extend  . 
extend 
ex  tend 


by  4 
by  1 
by  3.1 


and 

1.  sunny  ♦ havef un (at-beach) 

2.  rainy  ♦ havef  un  ( at-mov.ie) 

3.2.  rainy 

4.  havefun(s)  ♦ G 

5.  G 

6.  [G]  hav.efun(s) 

7.  (G)  [havef un ( at-movie) 1 rainy 

8.  0 


theorem 
extend  by  4 
extend  by  2 
extend  by  3. 2 


The  synthesized  plan  is  then  "if  it  is  sunny  tnen  go  to  the 
beach  else  go  to  the  movie"  (or,  equivaLently,  "if  it  is  rainy 
then  go  to  the  movie  else  go  to  the  beach"),. 

An  alternative  approach  possible  in  the  complete  MESON 
procedure  is  demonstrated  in  the  -following  proof: 


1.  sunny  ♦ havef un ( at-beach) 

2.  rainy  ♦ havef un ( at-movie) 

3.  sunny  v rainy 

4.  havefun(s)  ♦ G 

5.  G 

6.  [G]  hav.efun(s) 

7.  [G]  [havef un ( at-beach) ] sunny 

8.  [G]  [havef un ( at-beach) 1 [sunny] 
-•rainy 

9.  [G]  [havefun (at-beach) ] [sunny] 
-•havef  un  ( at-movie) 

10.  [G]  [havef un  ( at-.beach)  ] [sunny] 

[-•havef  un  ( at-movie)  ] -•G 

11.  0 


theorem 
extend  by  4 
extend  by  1 


extend  by  3a 
[-•rainy] 

.extend  by  contrapositive  of  2 
[-•rainy] 

extend  by  contrapositive  of  4 
red uce 


A synthesized  plan  can  now  be  generated  from  this  proof  in  a 
fashion  similar  to  the  extraction  of  programs  from  resolution 
proofs  developed  by  Walding.er  [51].  Although  there  are 
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undoubtedly  more  direct  approaches  to  generating  the  plan  from 
the  proof,  that  the  plan  can  be  derived  is  evident  since  the 
MESON  procedure  proof  has  an  equivalent  resolution  proof  to  which 
the  Waldinger  algorithm  may  be  applied. 


2.2.  Terminology 

Definition  A fifiio!  is  defined  to  be 

(1)  a variable,  or 

(2)  a function  symbol  followed  by  a list  of  terms.  The  term  is 
called  a constant  if  the  list  of  terms  following  the  function 
symbol  is  empty. 

Variables  will  usually  be  denoted  by  lower  case  letters  at  the 
end  of  the  alphabet:  e.g.,  u,v,w,x,y,z.  Constants  will  usually 
be  denoted  by  lower  case  letters  at  the  beginning  of  the 
alphabet:  e.g.,  a,b,c,d,e.  Function  symbols  of  higher  degree 
will  usually  be  denoted  by  lower  case  letters  occurring  after 
constants  in  the  alphabet:  e.g.,  f,g,h,j,k.  Exeunple  terms  are 
then  X,  y,  a,  b,  f(a,b),  g (f (x,h (y) ) ) . 

2,2.  An  atomic  iOimula  is  defined  to  be  a 
predicate  symbol  followed  by  a list  of  terms.  The  atomic  formula 
is  called  a Koposition  if  the  list  of  terms  following  the 
predicate  symbol  is  empty. 

Predicate  symbols  will  usually  be  denoted  by  upper  case 
letters:  e.g.,  P,  P,  Q,  R.  Example  atomic  formulas  are  then  P, 


24 


Q(a,b),  R{f (x,h (y) ) ) . The  parentheses  surrounding  the  list  of 
terms  following  the  predicate  symbol  and  commas  separating  terms 
are  often  omitted;  Qab,  Rf(xh(y)). 

PSliDi^i2D  2jl3..  a formula  is  defined  to  be 

(1)  an  atomic  formula, 

(2)  VxB  where  B is  a formula  and  x is  a variable, 

(3)  3xB  where  B is  a formula  and  x is  a variable, 

(4)  -iB  where  B is  a formula, 

(5)  (A  ^ B)  where  A and  B are  formulas, 

(6)  (A  ^ B)  where  A and  B are  formulas, 

(7)  (A  -►  B)  where  A and  B are  formulas,  or 

(8)  (A  B)  where  A and  B are  formulas. 

Formulas  without  the  parenthetical  structure  dictated  by  the 
above  definition  can  be  interpreted  according  to  the  following 
rules: 

(1)  the  operators  listed  in  decreasing  order  of  precedence  are 

Vx,  ^x , 1,  ♦,  and  thus,  for  example,  -A  ''  B ♦ C is 

equivalent  to  ( (-A  ''  B)  ♦ C)  , 

(2)  ^ , and  are  associative  and  thus  A ''  B ''  C is  equivalent 

to  (A  ''  (B  ''  O)  and  ( (A  ''  B)  ''  C)  , A ''  B C is  equivalent  to  (A 

(B  C))  and  ((A  ^ B)  ^ C)  , and  (A  B C)  is  equivalent  to  (A 

( B C ) ) and  ( ( A*-^  B ) ♦^  C ) , and 

(3)  ♦ associates  to  the  right  and  thus  A ♦ B ♦ C is  equivalent  to 
(A  - (B  ♦ O)  . 

Formulas  are  intended  to  be  evaluated  to  true  or  false.  The 
value  of  VxA  is  true  if  A is  true  for  every  value  of  x and  false 
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otherwise.  The  value  of  ^xA  is  true  if  A is  true  for  some  value 
of  X and  false  otherwise.  The  value  of  -lA  is  true  if  A is  false 
and  false  if  A is  true.  The  value  of  A v B is  true  if  A or  B is 
true  and  false  if  both  are  false.  The  value  of  A a b is  true  if 
A and  B are  true  and  false  if  either  is  false.  The  value  of  A 
B is  true  if  A is  false  or  B is  true  and  false  if  A is  true  and  B 
is  false.  The  value  of  A B is  true  if  A and  B are  true  or  A 
and  B are  false  and  false  if  they  are  neither  both  true  nor  both 
false . 

PSliDi£i2D  2^4.  A literal  is  defined  to  be  a formula  of  the 
form  A or  -lA  where  A is  an  atomic  formula. 

The  complement  ~C  of  the  literal  C is  A if  C 
is  of  the  form  -lA,  -lA  if  C is  of  the  form  A. 

P®^iDi£i2D  An  assertion  is  defined  to  be  a formula  of 

the  form  Aj^  ^ ^ . ''  (m^O,  n^l)  where  each  Aj^ 

and  each  Cj  is  a literal.  Each  Aj^  is  called  an  antecedent  of  the 
assertion  and  each  Cj  is  called  a conseguent  of  the  assertion. 

Definition  An  injplication  is  defined  to  be  a formula  of 

the  form  Aj^  ^ ...  Aj„  ♦ C {m>l)  where  each  Aj^  and  C is  a 
literal r i.e.,  an  implication  is  an  assertion  with  n-1. 

Definition  2.8.  A disjunction  is  defined  to  be  a formula  of 
the  form  '^  . . . (n^l)  where  each  Cj  is  a literal,  i.e.,  a 

disjunction  is  an  assertion  with  m^o.  it  is  also  called  a 

1 212222 • 
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P®liDi£i25  2^9.  A BESnji§§  is  defined  to  be  a formula  of  the 
form  C where  C is  a literal,  i.e.,  a premise  is  an  assertion  with 
n=l  and  m*0.  It  is  also  called  a unit  clause  since  it  is  a 
clause  with  a single  literal. 

PS|iDi^i2D  2a.lfl.  A gositive  literal  is  an  atomic  formula. 

P2liQi£i2D  2j.I1.  a DSflfltiXfi  ll£S£Sl  is  a negated  atomic 
formula. 

P®^iDi-i2!}  2^12.  A Horn  clause  is  a clause  containing  not 
more  than  one  positive  literal. 

Definition  2j.l3.  A Horn  set  is  a set  of  Horn  clauses. 

Definition  2.14.  A gositive  clause  is  a clause  containing 
only  positive  literals,  i.e.,  having  no  negative  literals. 

P2|iDiii2D  2^15.  A negative  clause  is  a clause  containing 
only  negative  literals,  i.e.,  having  no  positive  literals. 

Definition  2._16.  A mixed  clause  is  a clause  containing  both 
positive  and  negative  literals. 


2.3.  Problem  reduction  method 

The  problem  reduction  method  is  a widely  understood,  very 
general,  and  nearly  universally  applicable  problem  solving 
technique.  Informally  defined,  the  problem  reduction  method 
consists  of  finding  for  a goal  conjunctive  sets  of  subgoals  the 
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simultaneous  solution  of  one  of  these  constituting  a solution  of 
the  original  goal.  The  method  is  recursively  applied  to  each 
generated  subgoal  until  the  resultant  subgoals  are  all 
immediately  solvable,  or  some  subgoal  is  unsolvable  and  that 
particular  attempted  solution  is  abandoned. 

In  this  thesis,  we  are  concerned  with  the  solution  of  problems 
expressed  symbolically.  Accordingly,  we  will  give  a more  formal 
definition  of  the  problem  reduction  method  as  applied  to  symbolic 
expressions. 

Let  B^,...,Bp  be  a set  of  assertions  of  the  form  a ...  a Aj^ 
♦ C (m^O)  (implications  and  premises),  and  C be  a single  literal 
goal.  An  implication  B|^  of  the  form  A^  a . . . a Aj^  ♦ C represents 
the  statement  that  "to  solve  (prove)  C,  solve  (prove)  A^,  A2, 

...,  Ajjj" . A premise  of  the  form  C (i.e.,  m»Q ) represents  that  C 
is  immediately  solvable  (provable,  i.e.,  true). 

Following  is  a definition  of  the  problem  reduction  method  in 
terms  of  its  single  inference  rule. 

iiU-  Bj, . . . ,Bpt-pC  if 
B|^isAj^A...  aa^-»C  (m^O) 
and  B^, . . . , Bpt-pAj^ 

and  Bj^, . . . , Bpi-pAj^ 

and  Bi, . . . , Bpt-pAn,  for  some  k (l£k£p)  . 

Here,  "t-p"  represents  solvability  (provability)  by  the 
problem  reduction  method. 
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It  can  be  readily  seen  that  the  problem  reduction  method  is 
complete  in  those  instances  where  the  negation  of  a goal  has  no 


meaning.  In  other  words,  if  , Bp>-C  then  B^, . . . , Bpt-pC, 

i.e.,  C is  derivable  by  the  problem  reduction  method  from 
B2^,...,Bp.  This  completeness  where  negations  of  goals  are 
meaningless  is  a result  of  the  problem  reduction  method  being 
isomorphic  to  the  input  resolution  procedure,  a complete  theorem 
proving  procedure  for  Horn  formulas  which  are  a representation 
for  problems  with  meaningless  negations. 

However,  the  problem  reduction  method  is  incomplete  when 
applied  in  situations  where  the  negations  of  goals  ate  employed. 

A simple  example  is  the  attempt  to  solve,  by  the  problem 
reduction  method,  the  goal  A given  implications  B ♦ A and  -iB  A. 
It  is  apparent  that,  under  the  usual  interpretation  of  that  A 
is  a consequence  of  B A and  -iB  ♦ A.  Yet  the  problem  reduction 
method  has  insufficient  logical  capability  to  detect  this  fact. 

It  merely  establishes  separate,  disjunctive  subgoals  B and  -iB, 
recognizes  that  it  can  solve  neither,  and  fails. 

A less  serious  logical  difficulty  of  the  problem  reduction 
method  is  its  incompleteness  in  the  absence  of  contraposi tives  of 
implications.  From  B and  -A  ♦ -iB  we  can  deduce  A,  but  not  by  the 
problem  reduction  method.  The  contrapositive  B ♦ A of  -*A  ♦ -fi 
must  be  included  among  the  implications  to  generated  a problem 
reduction  method  solution.  This  difficulty  is  also  related  to 
the  negations  of  goals  since,  if  negations  of  goals  are 
meaningless,  the  contraposi tives  would  also  lack  meaning  since 
they  contain  negated  goals. 
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We  shall  be  concerned  in  this  chapter  with  presenting  a 
theorem  proving  (problem  solving)  procedure  which  addresses  the 
logical  difficulties  of  the  problem  reduction  method  while 
retaining  its  form. 

The  necessity  for  presenting  contrapositives  is  retained  in 
this  theorem  proving  method. 

The  impossibility  of  proving  A from  B « A and  -<B  « A by  the 
problem  reduction  method  is  solved  by  modifying  the  single 
problem  reduction  method  inference  rule.  Instead  of  using  only 
originally  supplied  implications  to  generated  subgoals,  the 
negations  of  higher  subgoals  may  also  be  used. 

For  example,  in  the  problem  cited  above,  we  can  establish  B as 
a subgoal  of  A using  B ■»  A.  we  ceui  then  establish  ->A  as  a 
subgoal  of  B using  -A  * Q (the  contrapositive  of  ->B  « A)  . At 
this  point,  the  .ordinary  problem  reduction  method  would  fail 
since  *'A  occurs  as  the  consequent  of  no  implication.  However,  we 
permit  the  use  of  the  negations  of  higher  goals  as  premises. 

Thus,  applying  lA  (the  negation  of  the  top  goal  A)  and  acting  as 
if  -tA  were  an  immediately  solvable  goal,  we  "solve*  ^A  which 
"solves"  B which  solves  A. 

The  validity  of  this  form  of  reasoning  is  demonstrated  by 
showing  it  is  a form  of  reasoning  by  contradiction:  either  A is 
true  or  false;  if  A is  true,  then  the  top  goal  A is  true;  if  A is 
false,  then  (by  -lA  ♦ 3)  3 is  true  anJ  (by  3 ♦ A)  A is  true,  a 
contrailiction;  therefore,  A nust  be  trus.  ^ute  that  this 
reasoning  makes  no  statement  about  the  truth  or  falsity  of  B. 
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Given  the  implications  -lA  ♦ B and  B 
of  the  value  of  B. 


A,  A is  true  indepedently 


2.4.  iHerbrand's  theorem 

■In  the  propositional  calculus  (the  restriction  of  the  <first 
order  predicate  calculus  to  atomic  .formulas  which  have  0-ary 
predicate  symbols,  and  .consequent  elimination  of  quantifiers),  it 
is  easy  to  devise  a proof  procedure  which  can  prove  or  disprove 
any  formula.  An  example  of  such  a proof  procedure  is  Wang's 
algorithm  (52J  which,  given  any  formula,  will  respond  with  either 
"VALID*'  or  "NOT  VALID”  if  the  .formula  is  or  is  not  a theorem 
respectively,  and  will  present  a proof  or  disproof  of  the 
formula.  In  contrast  to  tnis  nice  situation  in  the  propositional 
calculus,  the  .first  order  predicate  calculus  is  undecidable 
meaning  that  there  is  no  procedure  which  can  determine  validity 
or  invalidity  of  all  formulas  in  finite  time.  Consequently,  the 
strongest  theorem  proving  procedure  .for  the  whole  of  the  first 
order  predicate  calculus  would  have  the  .following  properties: 

(1)  it  can  prove,  in  a .finite  number  of  steps,  any  valid  formula 
(completeness)  , 

(2)  it  cannot  prove  any  formula  which  is  invalid  (soundness),  and 

(3)  an  attempt  to  prove  a formula  which  is  invalid  may  result  in 
an  infinite  computation. 

Theorem  proving  procedures  based  on  Herbrand's  ♦•heorem  have 
these  properties.  Prior  to  stating  Herbrand's  theorem,  and 
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discussion  of  procedures  based  on  it,  it  is  necessary  to  present 
a series  of  definitions  of  standard  forms  of  formulas  referred  to 
in  the  statement  of  Herbrand's  theorem. 

The  pisQSS  iSIlS  of  a formula  is  the 
logically  equivalent  formula  in  which  all  quantifiers  appear  at 
the  beginning  of  the  formula.  More  precisely,  a formula  A is 
said  to  be  in  prenex  form  if: 

(1)  it  is  in  the  form  Qx2...Qx^B  where 

(2)  each  Qx^  (l^i<n)  is  either  Vxjj^  or  3x^  where  x^  is  a variable, 
and 

(3)  B is  an  open  formula,  i.e.,  B contains  no  quantifiers. 

The  list  of  quantifiers  Qx^...Qx^  is  called  the  prefix  of  A 

and  B is  called  the  matrix  of  A. 

ixi*  (2)  is  the  prenex  form  of  (1). 

(1)  -iVy  [Fyx  -i^z  ( Fyz  a Fzy)  J 

(2)  3yVziw((Fyx  A Fyw  a Fwy)  v (-iFyx  a (-*Fyz  v -iFzy)  ) ) 

24.0*  The  closure  of  a formula  A in  prenex  form 
with  prefix  Qxj^ , . . . , QXjj  and  variables  Xj^ , . . . ,Xj^,yj^, . . . .y^^  is 
Vy^. . •VyjnQxj^. . .QXpB  where  B is  the  matrix  of  A.  A closed  formula 
in  prenex  form  represents  the  standard  form  of  a formula  with 
respect  to  its  quantifiers.  (3)  is  the  closure  of  (2),  and  is 
(1)  in  closed  prenex  form. 

(3)  Vx3yVz3wl  (Fyx  a Fyw  a Fwy)  v (-»Fyx  a (nFyz  v -»Fzy)  ) ] 

92£iDi-i2D  2i22*  §iS2iS2  l2Q£ii2Dfi  l2£!!!  ® formula  is  a 
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special  form  in  which  all  universal  quantifiers  have  been 
eliminated.  Universally  quantified  variables  are  replaced  by 
"Skolem  functions"  of  existentially  quantified  variables 
specially  created  for  this  purpose.  Let  A be  a formula  in  closed 
prenex  form.  The  Skolem  functional  form  of  A,  denoted  by  Ay,  is 
defined  as  follows: 

(1)  If  A is  existential  (has  only  existential  quantifiers)  then 
A^  is  just  A. 

(2)  Otherwise,  since  A is  in  prenex  form  and  must  have  at  least 
one  universal  quantifier,  it  must  be  of  the  form  3Xj...3XpVyB 
where  B is  a (not  necessarily  open)  formula.  A^  is  then  the 
Skolem  functional  form  of  A*  where  A*  is  defined  to  be 

ixj. . .5x^8.  {y..g  (x^, . . . ,x„)  } . {y ..g  ( x^ , . . . , x^)  } is  a substitution 
which  replaces  all  occurrences  of  y by  g ( x^^ » • • • * x^)  where  g is  a 
specially  created,  unique  n-ary  Skolem  function. 

Thus  the  procedure  for  generating  the  Skolem  functional  form 
of  A is  to  generate  the  finite  sequence  of  formulas  A,  A*,  A**, 
....  Each  element  of  the  sequence  has  one  fewer  universal 
quantifiers  than  its  predecessor;  the  last  element  A^  is 
existential . 

Example  These  are  the  steps  in  the  creation  of  the 

Skolem  functional  form  of  (3).  (3.1)  is  (3)*;  (4)  is  (3.1)*  or 

(3) **  and  is  the  Skolem  functional  form  of  (3). 

(3.1)  3yVz3w(  (Fya  a Fyw  a Fwy)  v (-«Fya  a (-»Fyz 'v  -»Fzy)  ) J 

(4)  3y3wl(Fya  a Fyw  a Fwy)  v (-.Fya  a (-iFyg(y)  v -.Fg(y)y))] 

The  Skolem  functions  that  were  introduced  are  the  0-ary 
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function  a which  replaced  x and  the  unary  function  g which 
replaced  w. 


Note  that  due  to  the  non-uniqueness  of  the  closed  prenex  form, 
the  Skolero  functional  form  is  not  generally  unique.  If,  during 
the  formation  of  the  closed  prenex  form  of  the  formula,  one 
strives  to  place  universal  quantifiers  as  close  to  the  beginning 
of  the  quantifier  list  as  possible,  the  resulting  Skolem 
functional  form  will  look  simpler,  having  Skolem  functions  with  a 
smaller  number  of  arguments. 

It  is  also  possible  to  generate  the  Skolem  functional  form  of 
a formula  without  initially  putting  the  formula  into  prenex  form. 

This  is  discussed  in  Section  Z.  8. 

?!JS2£S!S  2j,1.  (Herbrand's  theorem,  version  1)  The  closed 
formula  A in  prenex  form  is  a theorem  if  and  only  if  a 
disjunction  of  ground  instances  of  the  matrix  of  is  a 
tautology. 

To  say  that  a formula  is  a tautology  means  that  there  is  no 
assignment  of  truth  values  to  ground  instances  of  atoms  for  which 
the  formula  is  false,  i.e.,  the  formula  is  invariably  true  under 
all  possible  truth  value  assignments. 

Since  most  automatic  theorem  proving  programs  actually  prove 
theorems  by  refuting  their  negations,  the  statement  of  Herbrand's 

j 

theorem  they  use  is  based  on  the  notion  of  unsatisfiability.  A 
formula  is  unsati sf iable  if  there  does  not  exists  an  assignment 
of  truth  values  to  ground  instances  of  atoms  in  the  formula  for 
which  the  formula  is  true.  In  contrast  to  a tautology  which  is 
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always  true,  an  unsatisfiable  formula  is  always  false.  A truth 
functionally  unsatisfiable  formula  is  a ground  formula  which  is 
unsatisfiable.  Since  a formula  is  a theorem  if  and  only  if  its 
negation  is  unsatisfiable,  Herbrand's  theorem  can  now  be  stated 
as  the  following. 

2^2.  (Herbrand's  theorem,  version  2)  The  closed 
formula  A in  prenex  form  is  a theorem  if  and  only  if  a 
conjunction  of  instances  of  the  matrix  of  -lAjj  (the  negation  of 
its  Skolem  functional  form)  is  truth  functionally  unsatisfiable. 

Note  that  the  usage  of  -*Ay  (the  negation  of  the  Skolem 
functional  form)  suggests  an  alternate  method  of  preparing  a 
formula  for  application  of  Herbrand's  theorem.  The  formula  may 
be  negated  first;  then  transformation  to  prenex  form  and 
substitution  of  Skolem  functions  for  variables  can  be  done. 

Since  negating  the  formula  changes  universal  quantifiers  into 
existential  quantifiers  and  vice  versa,  the  substitution  may  be 
accomplished  by  replacing  existentially  (rather  than  universally) 
quantified  variables  with  Skolem  functions  of  universally 
quantified  variables.  For  example,  the  Skolem  functional  form  of 
the  (already  negated)  formula  VxByPxy  is  VxPxg(x).  The  intuitive 
interpretation  of  the  term  g(x)  is  that  it  is  the  function 
providing  for  any  x the  y for  which  Pxy  is  true.  Further,  this 
approach  more  compellingly  illustrates  the  fact  that  the  final 
prefix  always  consists  solely  of  universal  quantifiers.  Thus  all 
variables  in  the  matrix  may  be  regarded  as  being  implicitly 
universally  quantified,  and  the  prefix  discarded.  This  way  of 
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generating  the  input  form  for  Herbrand's  theorem  is  the  one  most 
used  and  described  in  theorem  proving  applications,  although  it 
differs  from  the  historical  approach  described  earlier. 

In  order  to  use  the  above  definitions  for  proving  theorems,  it 
is  necessary  to  define  the  set  of  terras  which  can  be  used  in 
creating  the  needed  instances.  This  set  of  terms  is  defined 
below. 


PSliOiiiSO  2j,21.  The  Herbrand  universe  for  formula  A consists 
of 

(1)  the  variables  of  Ay,  and 

(2)  terras  constructed  of  n-ary  function  symbols  of  A^  followed  by 
n terras  of  the  Herbrand  universe  of  Ay. 

This  definition  of  the  Herbrand  universe  is  actually  an 
extremely  important  concept.  It  means  essentially  that  all  the 
vocabulary  necessary  to  mechanically  prove  a theorem  is  contained 
in  the  statement  of  the  theorem — there  is  no  cause  for  resorting 
to  the  use  of  additional  symbols. 

Note  that  if  Ay  contains  at  least  one  function  symbol  with 
degree  greater  than  zero,  the  Herbrand  universe  has  an  infinite 
number  of  elements.  This  precludes  the  possibility  of 
enumerating  a finite  number  of  terms  and  testing  the  validity  of 
a formula  with  each  combination  of  values  of  (a  finite  number  of) 
ground  instances  of  the  formula.  This  is  the  feature  of  the 
first  order  predicate  calculus  which  may  ultimately  lead  a 
theorem  prover  into  an  infinite  computation  when  trying  to  prove 
a formula  which  is  not  a theorem. 
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The  Hecbrand  universe  of  (4)  is  the  set 
ly»w,a,g{y) ,g(w) ,g(a) ,g(g(y) ) 

So  far  in  preparing  the  theorem  to  be  proved  for  application 
of  Herbrand's  theorem,  the  quantifiers  and  variables  have  been 
manipulated,  but  the  basic  structure  of  the  matrix  has  been  left 
unchanged.  Putting  the  matrix  into  a standard  form  will 
facilitate  checking  of  truth  functional  unsatisfiability. 

PSliOition  2Li.22*  The  conjunctive  normal  fdtm  of  the  matrix  of 
a formula  is  the  logically  equivalent  formula  which  is  a 
conjunction  of  clauses. 

The  matrix  of  a formula  is  in  conjunctive  normal  form  if  it 
has  the  form  v ...  v a ...  a v ...  v ^nn^,^  where 

each  L^j  is  a literal.  The  matrix  of  any  formula  can  be 
transformed  into  a (generally  non-unique)  conjunctive  normal 
form. 

Example  2^5.  This  is  the  generation  of  the  conjunctive  normal 
form  of  (4.1),  the  negation  of  (4).  (4.2)  and  (4.3)  are 

intermediate  steps,  and  (5)  is  the  final  result  of  all 
preparation  for  applying  Herbrand's  theorem:  the  conjunctive 
normal  form  of  the  negation  of  the  Skolem  functional  form  of  (1). 

(4.1)  -«3z3w((Fya  a Fyw  a Fwy)  v 

(iFya  A (-«Fyg(y)  v -iFg(y)y))l 

(4.2)  V2Vw-i((Fya  a Fyw  a Fwy)  v 

(-iFya  A (-<Fyg(y)  v -•Fg(y)y))] 

(4.3)  V2VwI(-iFya  v -iFyw  v .»Fwy)  a 

(Fya  V (Fyg(y)  a Fg(y)y))] 
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(5)  V2Vwl(->Fya  v -iFyw  v -iPwy)  a 
(Fya  V Fyg(y))  a 
(Fya  V Fg(y)y)  1 

Besides  being  notationally  convenient,  putting  -«A  into 
conjunctive  normal  form  has  an  extremely  important  advantage. 

This  results  from  the  fact  that  a conjunction  is  truth 
functionally  unsati sf iable  if  and  only  if  a (perhaps  improper) 
subset  of  the  set  of  conjuncts  is  truth  functionally 
unsatisf iable.  Thus,  Herbrand's  theorem  can  now  be  stated  in  the 
final  form  in  which  it  is  customarily  used. 

Theorem  2^3.  (Herbrand's  theorem,  version  3)  The  closed 
formula  A in  prenex  form  is  a theorem  if  and  only  if  a 
conjunction  of  instances  of  the  clauses  of  nAu  is  truth 
functionally  unsati sfiable. 

Example  2^6.  This  is  a demonstration  of  the  truth  functional 
unsatisfiability  of  the  conjunction  of  a set  of  instances  of  the 
clauses  of  (5).  According  to  Herbrand's  theorem,  this 
constitutes  a proof  of  (1). 

I -•Faa  is  an  instance  of  -«Fya  v ^pyw  v -«Fwy  obtained  by 
replacing  both  y and  w by  a, 

II  -•Fag(a)  v -.Fg(a)a  is  also  an  instance  of  -*Fya  v -»Fyw  v -*Fwy 
obtained  by  replacing  y by  g(a)  and  w by  a, 

III  Faa  V Fag (a)  is  an  instance  of  Fya  v Fyg(y)  obtained  by 
replacing  y by  a,  and 

IV  Faa  V Fg(a)a  is  an  instance  of  Fya  v Fg(y)y  obtained  by 
replacing  y by  a. 
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That  Faa  must  be  false  can  be  .inferred  -from  .1;  therefore,  by  .III 
and  iIV,  Fag  (a)  and  Fg(a)a  must  both  be  true.  But  under  this 
assignment,  clause  .II  is  assigned  the  value  -false,  and  the  entire 
conjunction  is  assigned  the  value  -false.  Thus,  the  conjunction 
of  .1,  II,  -III,  and  iIV  is  unsati sfiable.  The  same  result  can  be 
obtained  by  examining  the  truth  table  which  considers  al-1 
possible  assignment  of  truth  values  to  the  atoms  of  clauses  .1, 

II,  ILI,  and  -IV.  The  value  of  the  conjunction  of  these  -four 
clause  is  tabulated  below  for  each  possible  truth  value 
assignment. 

Faa Facial Ssiala ^.I ^^11 ^^III ^^ly 

~f  T ~ T ~ F F T T F 

TTF  FTTTF 

TFTF  TTTF 

TFFFTTTF 
FTTTFTTF 
FTFTTTFF 
FFTTTF  .TF 

F F F T T F F F 

The  .earliest  applications  of  Herbrand's  theorem  to  automatic 

theorem  proving  involved  -Level  saturation  [421.  .This  consists  of 

generating  a sequence  CqCc^c Cj,*:. ...  of  sets  of  instances  of 

clauses  in  the  matrix  of  the  .Skolem  -functional  -form  of  the 

-formula  to  be  proved.  Each  set  is  checked  to  determine  if  it  is 

truth  -functionally  unsati  sfiable.  -If  so,  the  procedure  is 

terminated  and  the  -formula  is  a theorem.  Otherwise,  the  next  set 

in  the  sequence  is  generated.  .Should  the  next  set  be  equal  to 

the  current  set  (no  new  clause  instances  were  added)  the  -formula 

is  not  a theorem.  For  most  cases,  however,  (specifically, 

whenever  there  is  at  least  one  variable  and  one  non-constant 
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[ 


1 

function  symbol) , non-theorems  will  result  in  an  infinite 
sequence  of  sets  rather  than  this  termination. 

The  criterion  commonly  used  to  determine  which  instances  are 
to  be  generated  as  new  instances  of  each  set  is  the  complexity  of 
the  instances.  Thus  would  contain  all  clauses  with  complexity 
measure  less  than  or  equal  to  n.  The  most  common  measure  of 
complexity  of  a term  is  its  function  depth,  a measure  of  the 
nesting  of  function  symbols  in  the  term.  For  a constant  or  ! 

variable,  the  function  depth  is  defined  to  be  zero.  The  function 
depth  of  a term  composed  of  an  n-ary  function  symbol  (n>0)  is 
defined  to  be  one  plus  the  maximum  of  the  function  depths  of  the 
arguments.  For  example,  f(g(x)a)  has  function  depth  2 and  f(a) 
has  function  depth  1.  Function  depth  can  be  easily  computed  by 
counting  the  maximum  number  of  nested  pairs  of  parentheses. 

The  rule  for  defining  the  set  could  then  be:  is  defined 

to  be  the  set  of  all  instances  of  clauses  in  the  matrix  of  the 
Skolem  functional  form  of  the  formula  to  be  proved  formed  by  l 

using  terms  of  the  Herbrand  universe  with  function  depth  less 
than  or  equal  to  i in  the  substitutions. 

The  level  saturation  procedure  takes  its  name  from  the  fact 
that  each  successive  set  in  the  sequence  represents  a higher 
level  of  complexity  of  clause  instances,  and  each  set  contains 
all  instances  of  a specified  complexity  level. 

This  procedure  is  not  at  all  practical  for  interesting 
problems  due  to  the  rapid  growth  rate  in  the  number  of  clauses 
that  must  be  generated  and  considered  as  the  complexity  level 
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increases.  The  procedure  demands  that  all  instances  less  complex 
than  the  most  complex  clauses  required  in  the  proof  be  generated, 
and  the  most  complex  clause  requited  in  the  proof  is  frequently 
quite  complex. 


2.5.  Resolution  theorem  proving  procedure 

The  majority  of  automatic  theorem  proving  programs  for  the 
first  order  predicate  calculus  are  based  on  the  resolution 
principle  introduced  by  J.  A.  Robinson  [41].  The  resolution 
principle  is  fundamentally  based  on  a rule  of  reasoning  that  the 
clause  A V B can  be  inferred  from  the  conjunction  of  the  clauses 
A V c and  A v -»C.  A v B is  called  a resolvent  of  the  other  two 
clauses  which  are  called  parent  clauses  of  A v b. 

We  will  define  resolution  in  terms  of  two  operations:  binary 
resolution  and  factorization. 

2.5.1.  Binary  resolution.  Let  K and  L be  two  clauses.  Let  K* 
and  L'  be  variants  of  K and  L respectively  such  that  K’  and  L' 
have  no  variables  in  common.  If  C is  a literal  of  K',  D is  a 
literal  of  L',  and  Cg  is  tD,  or  Dg  is-»Cg  for  some  unifying 
substitution  a then  (K'-C)o  (I*’”D)g  is  a binary  resolvent  of  K 

and  L where  (K'-C)  is  the  clause  K'  with  C removed  and  (L'-D)  is 
the  clause  L*  with  D removed.  This  is  called  binary  resolution 
in  contrast  to  other  resolution  operations  since  exactly  two 
literals  are  involved  in  the  unification  process:  one  from  K and 
one  from  L. 
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We  will  show  a schematic  representation  for  various 
operations.  The  format  will  be  the  inputs  followed  by  a 
horizontal  line  followed  by  the  output.  q,  0,  y,  and  j will 
denote  sequences  or  disjunctions  of  literals;  C and  D will  denote 
single  literals.  The  resolution  operation  can  be  represented  as: 

input  clausej^  a v C v 8 
input  clause2  tt  v -C  v 6 

resolvent  a v b v y v «. 

2.5.2.  Binary  factorization.  Let  K be  a clause.  If  C and  D 
are  separate  literals  of  K and  Co  is  Do  for  some  unifying 
substitution  o then  (K-D)o  is  a binary  factor  of  K.  Similarly  to 
binary  resolution,  this  is  called  binary  factorization  since 
exactly  two  literals  are  involved  in  the  unification  process. 

This  can  be  expressed  schematically  as 

input  clause  ovCvbvcvy 
factor  o V c V B V Y. 

More  general  resolution  and  factorization  operations  can  be 
defined.  Factorization  can  be  defined  to  permit  an  arbitrary 
subset  of  the  literals  of  a clause  to  be  unified  together  rather 
than  just  two.  Resolution  can  be  defined  to  permit  an  arbitrary 
subset  of  each  clause  to  be  unified  against  an  arbitrary  subset 
of  the  other  clause.  Each  of  these  more  general  operations  can 
be  readily  defined  in  terms  of  their  binary  counterparts. 

2.5.3.  Raw  resolution.  The  derivation  of  the  empty  clause 
from  a set  of  clauses  by  the  resolution  procedure  constitutes  a 
proof  of  the  unsatisfiability  of  the  set  of  clauses.  We  will 
call  raw  resolution  the  procedure  which  employs  binary  resolution 
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and  factorization  operations  with  no  restrictions  on  the  clauses 
they  operate  upon. 

2*1. 

1.  .tFya  -.Fyw  -«Fwy 

2.  Fya  Fyg(y) 

3.  Fya  Fg(y)y 

4.  -iFya  ->Fay  factor  l(a,b) 

5.  -iFaa  factor  4 

6.  Fag (a)  resolve  2(a)  and  5 

7.  Fg(a)a  resolve  3(a)  and  5 

8.  -«Fag(a)  resolve  4(a)  and  7 

9.  Q resolve  6 and  8 

Numerous  refinements  of  raw  resolution  have  been  developed  to 
reduce  the  number  of  possible  inference  operations  that  can  be 
performed  at  any  given  point  and  thus  reduce  the  number  of 
derived  clauses. 

2.5.4.  Tautology  elimination.  Tautology  elimination  permits 
the  procedure  to  eliminate  all  clauses  which  contain  two  literals 
which  are  complementary  to  each  other  and  are  thus  necessarily 
true  under  any  assignment  of  truth  values.  Tautology  elimination 
is  compatible  with  most  though  not  all  refinements  of  resolution. 

2.5.5.  Subsumption.  Clause  C subsumes  clause  D if  each 
literal  of  Ce  is  a literal  of  0 for  some  substitution  e.  Since  D 
is  true  whenever  C is  (C  being  more  general) , there  is  some 
justification  for  believing  that  D can  be  thrown  away  during  the 
search  for  a proof.  The  subsumption  refinement  permits  the 
elimination  of  newly  generated  clauses  if  a previously  generated 
clause  subsumes  it  (forward  subsumption)  and  the  elimination  of 
previously  generated  clauses  if  newly  generated  clause  subsumes 
it  (backward  subsumption).  Subsumption  in  some  form  is 
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compatible  with  many  refinements  of  resolution.  Two  major 
difficulties  may  occur  in  the  arbitrary  addition  of  the 
subsumption  refinement  to  an  otherwise  incomplete  procedure.  The 
first  concerns  the  compatibility  of  backward  subsumption  with  the 
search  strategy  being  employed.  If  a newly  generated  clause  is  a 
variant  (identical  except  for  variable  renaming)  of  a previously 
generated  clause  and  the  latter  is  eliminated  by  backward 
subsumption,  the  search  strategy  may  schedule  inference 
operations  to  be  performed  (on  the  newly  generated  chain)  in  such 
a way  that  an  infinite  sequence  of  variants  of  the  same  clause 
may  be  generated,  each  variant  eliminating  its  predecessor  and 
having  a lower  priority  for  being  operated  upon.  This  problem 
can  generally  be  overcome  by  always  checking  for  forward 
subsumption  before  backward  subsumption  thereby  always  retaining 
the  first  rather  than  the  most  recent  variant.  The  second  major 
difficulty  concerns  the  general  incompatibility  of  subsumption 
with  some  refinements.  For  example,  one  may  get  into  trouble 
using  subsumption  with  linear  refinements  since  the  ancestry  of  a 
clause  is  important  in  determining  what  clauses  it  can  be 
resolved  against.  Thus,  one  clause  could  subsume  another  with  a 
different  ancestry,  yet  the  subsumed  eliminated  clause  may  be 
essential  to  a proof  by  virtue  of  its  ancestry. 

2.5.6.  Set  of  support  strategy.  Let  S be  an  unsatisfiable  set 
of  clauses  and  T be  a subset  of  S such  that  S-T  is  satisfiable. 
The  set  of  support  refinement  dictates  that  no  members  of  S-T 
need  be  resolved  against  each  other.  Equivalently,  the  set  of 


support  refinement  dictates  that  every  resolvent  must  have  as  an 
ancestor  a member  of  T.  This  is  a quite  effective  restriction  on 
raw  resolution.  Generally,  a theorem  is  posed  in  terms  of  a 
neqated  conclusion,  some  special  hypotheses,  and  a set  of  axioms 
(e.g.,  axioms  for  groups  or  fields).  The  set  T is  often  taken  to 
be  just  the  negated  theorem  or  the  negated  theorem  plus  the 
special  hypotheses.  The  set  of  axioms  is  assumed  to  be 
consistent,  and  no  deductions  are  made  from  the  set  of  axioms 
without  referring  somewhere  in  the  deduction  to  clauses  in  the 
set  of  support,  i.e.,  the  negated  theorem  plus  special 
hypotheses.  This  makes  raw  resolution  somewhat  more  goal 
directed  since  without  the  set  of  support  refinement,  raw 
resolution  (among  other  things)  is  free  to  draw  all  conclusions 
from  the  set  of  axioms  without  referring  to  the  problem  at  hand. 

2.5.7.  Input  resolution.  Input  resolution  is  an  incomplete 
refinement  of  resolution  in  which  one  parent  of  each  resolvent  is 
an  input  clause. 

PSfi2i£i2G  Zx2.i-  An  ioBUt  £S§2ly£i2D  clause  C 

from  set  of  clauses  S is  a sequence  of  clauses  Cj^,...,Cn  such 
that 

(1)  (l£i<m)  is  a member  of  S, 

(2)  (m<i£n)  is  a binary  resolvent  of  a. factor  of  and  a 

factor  of  Cj  (l<j<m) , and 

(3)  C„  is  C. 

An  input  xfiSfilytiflD  from  set  of  clauses  S is  an 

input  resolution  deduction  of  the  empty  clause  □ from  S. 


Following  is  an  input  resolution  refutation  from 
(5).  Each  operation  in  the  proof  including  the  formation  of 
factors  as  required  in  the  definition  is  explicitly  shown. 


1. 

-iFya  -•Fyw  -»Fwy 

2. 

Fya  Fyg(y) 

3. 

Fya  Fg(y)y 

1 

4. 

->Fya  -*Fay 

factor 

1 (a 

,b) 

] 

1 

5. 

-iFaa 

factor 

4 1 

6. 

Fg  ( a)  a 

resolve 

3( 

a) 

and  5 

7. 

iFag (a) 

resolve 

6 

and 

4 (a) 

8. 

Faa 

resolve 

7 

and 

2(b) 

9. 

□ 

resolve 

8 

and 

5 

2.5.8.  Unit  resolution.  Unit  resolution  is  an  incomplete 


refinement  of  resolution  in  which  one  parent  of  each  resolvent  is 


a unit  clause. 


A uni£  resolution  deduction  of  clause  C from 
set  of  clauses  S is  a sequence  of  clauses  such  that 

(1)  (l£i£m)  is  a member  of  S or  a unit  factor  of  a member  of 

S, 


i (2)  (m<i£n)  is  a binary  resolvent  of  Cj  (l_<j<i-l)  and  Cj^ 

(l£k<j-l)  where  either  Cj  or  Cj^  is  a unit  clause,  and 
i (3)  Cjj  is  C. 

A unit  xS5Sly£i2D  IS£y&2£l2D  from  set  of  clauses  S is  a unit 

! resolution  deduction  of  the  empty  clause  Q from  S. 

I 
I 

Example  Following  is  a unit  resolution  refutation  from 


(5)  . 

1.  -•Fya  -«Fyw  -tFwy 

2.  Fya  Fyg(y) 

3.  Fya  Fg(y)y 

4.  -iFaa 

unit  factor  of  1 

5.  Fag (a) 

resolve  4 

and  2 (a) 

6.  Fg(a)a 

resolve  4 

and  3 (a) 

resolve  6 and  1 (a) 
resolve  7 (a)  and  6 
resolve  8 and  5 


7.  -iFg(a)w  -*Fwg(a) 

8.  -iFag(a) 


Q 


2.5.9.  Linear  resolution.  Linear  resolution  is  a complete 
refinement  of  resolution  in  which  one  parent  of  each  resolvent  is 
either  an  input  clause  or  an  ancestor  of  the  other  parent. 


A lloeai:  xssQlutiQo  i^educLiao  of  clause  c 
from  set  of  clauses  S is  a sequence  of  clauses  such 

that 

(1)  (l<i<m)  is  a member  of  S, 

(2)  (m<i£n)  is  a binary  resolvent  of  a factor  of  and  a 

factor  of  Cj  (l<j<i-l),  and 

(3)  C„  is  C. 

^ £SSSly£iSD  from  set  of  clauses  S is  an 

linear  resolution  deduction  of  the  empty  clause  Q from  S.  ■ 


S225}Bi£  2^10.  Following  is  a linear  resolution  refutation 
from  (5).  Each  operation  in  the  proof  including  the  formation  of 
factors  as  required  in  the  definition  is  explicitly  shown. 


1.  -«Fya  -*Fyw  -tFwy 

2.  Fya  Fyg(y) 

3.  Fya  Fg{y)y 

4.  Faa  -'Fg(a)w  •»Fwg(a) 

5.  Faa  -«Fg(a)a  Faa 

I 6.  Faa  -•Fg  ( a)  a 

7.  Faa  Faa 

[ 8.  Faa 

9.  -iFaa  -'Faa 

10.  -»Faa 

11.  0 


resolve  3(b)  with  1(a) 
resolve  4(c)  with  2(b) 
factor  5(a,c)  ] 
resolve  6(b)  with  3(b) 
factor  7 J 
resolve  8 with  1(b) 
resolve  9 with  3 
resolve  10  with  8 
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2.5.10.  Paramodulation 


Paramodulation  [56]  is  an  inference 


operation  in  addition  to  resolution  designed  to  facilitate 
equality  inferences. 

Let  K and  L be  two  clauses.  Let  K*  and  L*  be  variants  of  K 
and  L respectively  such  that  K'  and  L’  have  no  variables  in 
common.  If  C is  a literal  of  K ' , D is  a literal  of  L',  C (or  D) 
is  an  equality  literal  of  the  form  a*b  or  b»a,  and  the  other 
literal  contains  a term  unifiable  with  a for  some  unifying 
substitution  g,  then  {K'-C)o  v (L'-D)o  v Eo  where  E is  D (or  C) 
with  a single  instance  of  the  term  unified  with  a replaced  by  b 
is  a paramodulant  of  K and  L. 

This  can  be  expressed  schematically  as 

input  clausej^  o v c v g 
input  clause2  7 v d v 6 
where  C is  a=b  or  b»a 


paramodulant  avBVfVjvD* 
where  D'  is  D with  a single  a replaced  by  b. 

2.6.  Model  elimination  theorem  proving  procedure 

The  model  elimination  theorem  proving  procedure  was  developed 
by  Loveland  129).  The  strong  model  elimination  and  model 
elimination  with  paramodulation  refinements  were  also  developed 
by  Loveland  in  [30]  and  [31]  respectively.  We  will  take  some 
liberties  in  presenting  model  elimination  in  slightly  different 
fashion  from  Loveland. 

In  contrast  to  resolution,  the  fundamental  element  of  a model 
elimination  deduction  is  the  chain.  A chain  is  an  ordered 
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sequence  of  literals.  Two  types  of  literals  are  distinguished: 

B-litorals.  B-literals  correspond  to  the  literals 
present  in  clauses  in  resolution  theorem  proving.  A-literals 
record  ancestry  information  and  represent  assignments  of  truth 
values  to  literals  already  resolved  upon.  Model  elimination  is 
related  to  linear  refinements  of  resolution,  but  since  some 
ancestry  information  is  retained  in  the  chains  it  operates  upon, 
model  elimination  is  a complete  input  procedure  in  contrast  to 
the  incomplete  input  resolution  operating  on  clauses  rather  than 
chains . 

The  input  to  the  model  elimination  procedure  as  with 
resolution  is  a set  of  clauses.  One  of  the  clauses  is  designated 
to  be  the  top  chain  of  the  refutation  (the  set  of  support)  and  is 
converted  to  a chain  by  merely  selecting  an  ordering  for  its 
literals  and  fixing  that  order. 

PSfiSi£i2D  a mogsl  elimination  JMEi  deduction  of  chain 

K from  set  of  clauses  S is  a sequence  Kq,...,Kj^  of  admissable 
chains  such  that 

(1)  Kq  is  the  top  chain  derived  from  a member  of  S, 

(2)  is  K,  and 

(3)  (l£i<n)  is  derived  from  by  extension  by  a member  of 

S,  or  by  reduction. 

^ ISlytatign  is  a ME  deduction  of  the  empty  chain  0. 


2.6.1.  Adnissability.  An  admissable  chain  is  a chain  in  which 


f 


I 


I 


(1 ) no  B-literal  is  followed  by  a complementary  B-literal  or 
A-literal  unless  they  are  separated  by  at  least  one  A-literal, 

(2)  no  A-literal  is  followed  by  an  identical  B-literal  or 
A-literal , 

(3)  no  A-literal  is  followed  by  a complementary  A-literal,  and 

(4)  the  last  literal  of  the  chain  is  not  an  A-literal. 

A chain  is  termed  P£eadmissable  if  it  satisfies  requirements 
(1),  (2),  and  (3). 

2.6.2.  Extension.  The  extension  operation  takes  an  admissable 

chain  K and  an  input  clause  C as  its  arguments.  Let  K*  and  C be 

variable  disjoint  variants  of  K and  C.  If  the  last  literal  of  K' 

is  unifiable  with  the  complement  of  a literal  of  C,  the  chain 

consisting  of  K'  followed  by  the  remaining  literals  of  C with 

unifying  substitution  applied  is  formed.  Each  literal  of  the 

derived  chain  descended  from  a literal  of  K*  is  designated  to  be 

the  same  type  of  literal  as  its  ancestor  except  the  last  which  is 

designated  to  be  an  A-literal;  each  literal  of  the  derived  chain 

descended  from  a literal  of  C is  designated  to  be  a B-literal. 

If  the  derived  chain  is  preadmissable,  the  admissable  chain  with 

all  terminal  A-literals  deleted  is  returned;  no  chain  is  returned 

if  the  derived  chain  is  not  preadmissable. 

This  can  be  expressed  schematically  as 

input  chain  o Cj 
input  clause  v ...  v 

derived  chain  a {C^]  •••  ^i+1  •••  ^n* 


50 


2.6.3.  Reduction.  The  reduction  operation  takes  an  admissable 
chain  K as  its  argument.  l£  the  last  literal  of  K is  unifiable 
with  the  complement  of  a preceding  A-literal  of  K,  the  chain 
consisting  of  K with  the  last  literal  deleted  and  with  unifying 
substitution  applied  is  formed.  Each  literal  of  the  derived 
chain  is  designated  to  be  the  same  type  of  literal  as  its 
ancestor.  If  the  derived  chain  is  preadmissablC/  the  admissable 
chain  with  all  terminal  A-literals  deleted  is  returned;  no  chain 
is  returned  if  the  derived  chain  is  not  preadmissable. 

This  can  be  expressed  schematically  as 

input  chain  a [C]  6 


derived  chain  a [C]  b. 

ME  is  essentially  input  resolution  with  (1)  ordered  derived 
clauseSf  (2)  retention  of  resolved  upon  literals  as  A-literals, 
(3)  absence  of  the  factorization  operation,  and  (4)  presence  of 
the  reduction  operation.  Unlike  the  input  resolution  procedure, 
ME  is  complete  in  spite  of  the  absence  of  the  factorization 
operation  required  by  input  resolution. 

Examgle  2.11. 


1. 

-•Fya 

-iFyw  -•Fwy 

2. 

Fya 

Fyg(y) 

3. 

Fya 

Fg(y)y 

4. 

Faa 

[Fg(a)  a] 

-•Fg  ( a) 

w 

-Fwg  ( a) 

ex  tend 

by 

1(a) 

5. 

Faa 

IFg(a)  a] 

-Fg  ( a) 

a 

[-Fag (a) ] 

Faa 

ex  tend 

by 

2(b) 

6. 

Faa 

[Fg  (a)  a] 

-•Fg(a) 

a 

(-Fag (a) ] 

[Faa] 

-•Faa 

-•Faa 

extend 

by 

Kb) 

7. 

Faa 

(Fg  (a)  a] 

-•Fg  (a) 

a 

[-Fag (a) ] 

[Faa] 

-«Faa 

reduce 

(e, 

•g) 

S. 

Faa 

(Fg(a)  a] 

-Fg(a) 

a 

red  uce 

(e, 

,f) 

St. 

Faa 

r ed  uc  e 

(b, 

rC) 

10. 

[Faa]  -tFaa 

-Faa 

extend 

by 

Kb) 

11. 

[Faa]  -iFaa 

red  uce 

(a, 

rC) 

12. 

0 

reduce 
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2.6.4.  Strong  model  elimination.  Addition  of  the  equivalent 
of  the  factorization  operation  in  the  resolution  procedure 
results  in  strong  n)252§l  £lilDiQ2£i2Q*  Strong  model  elimination  is 
so  named  not  because  it  is  necessarily  more  powerful  (probably 
the  contrary  is  true),  but  because  the  chain  rejection  criteria 
corresponding  to  the  model  elimination  admissabil ity  requirement 
are  stronger. 

PSliniSiSG  A strong  ipodel  eiiiSiDation  Jstrgng_MEj. 

^§dy££i2D  of  chain  K from  set  of  clauses  S is  a sequence 
Kq,...,Kj^  of  acceptable  chains  such  that 

(1)  Kq  is  the  top  chain  derived  from  a member  of  S, 

(2)  Kjj  is  K,  and 

(3)  {l£i<_n)  is  derived  from  by  extension  by  a member  of 

S,  by  JEg2u££i2D»  by  factorization. 

A strong  {JE  rsf.Utatl2D  is  a strong  ME  deduction  of  the  empty 
chain 

2.6.5.  Acceptability.  An  acceptable  chain  is  a chain  in  which 

(1)  no  B-literal  is  followed  by  a complementary  B-literal  or 
A-literal , 

(2)  no  A-literal  is  followed  by  an  identical  B-literal  or 
A-literal , 

(3)  no  A-literal  is  followed  by  a complementary  A-literal, 

(4)  no  B-literal  is  followed  by  an  identical  A-literal,  and 

(5)  the  last  literal  of  the  chain  is  not  an  A-literal. 

A chain  is  termed  preaccegtable  if  it  satisfies  requirements 
(1),  (2),  (3),  and  (4). 
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2.6.6.  Extension.  The  strong  ME  extension  operation  is  the 
same  as  for  ME  except  it  operates  on  and  produces  acceptable 
rather  than  admissable  chains. 

2.6.7.  Reduction.  The  strong  ME  reduction  operation  is  the 
same  as  for  ME  except  it  operates  on  and  produces  acceptable 
rather  than  admissable  chains. 

2.6.8.  Factorization.  The  factorization  operation  takes  an 
acceptable  chain  K as  its  argument.  If  the  last  literal  of  K is 
unifiable  with  a preceding  B-literal  of  K,  the  chain  consisting 
of  K with  the  last  literal  deleted  and  with  unifying  substitution 
applied  is  formed.  Each  literal  of  the  derived  chain  is 
designated  to  be  the  same  type  of  literal  as  its  ancestor.  If 
the  derived  chain  is  pr eacceptable , the  acceptable  chain  with  all 
terminal  A-literals  deleted  is  returned;  no  chain  is  returned  if 
the  derived  chain  is  not  pr eacceptable. 

This  can  be  expressed  schematically  as 

input  chain  ^ C g C 
derived  chain  a C g. 

The  addition  of  the  factorization  operation  makes  strong  ME 
correspond  more  closely  to  input  resolution.  The  chain 
acceptability  criteria  now  permit  tautology  elimination  (both 
before  and  after  the  second  complementary  B-literal  has  been 
extended  upon)  , an  operation  not  completeness  preserving  for  ME 
(lacking  the  factorization  operation)  . 

1.  -iFya  -iFyw  -*Fwy 

2.  Fya  Fyg(y) 
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3. 

Fya 

Fg  (y)  y 

4. 

Fa  a 

(Fg  (a)  a] 

-•Fq(a)w  -iFwg(a) 

extend 

by 

1 (a) 

5. 

Fa  a 

IFg(a)al 

-«Fg{a)a  (-*Fag(a)]  Faa 

extend 

by 

2(b) 

6. 

Fa  a 

(Fg(a)  al 

-•Fg  (a)  a 

factor 

(a, 

rC) 

7. 

Fa  a 

reduce 

(b, 

-c) 

8. 

(Fac 

i]  -Fsa  - 

Faa 

extend 

by 

1(b) 

9. 

iFaa]  -Faa 

factor 

(b, 

-c) 

10. 

0 

reduce 

2.6.9.  Model  elimination  with  par amodulation.  Two  additional 
operations  are  available  for  handling  the  equality  relation 
specially  in  a fashion  similar  to  par amodulation. 

2^28.  A model  elimination  with  pa£§modulation 
^§^li££ioD  chain  K from  set  of  clauses  S is  a sequence 
of  permissable  chains 

(1)  Kq  is  the  top  chain  derived  from  a member  of  S, 

(2)  is  K,  and 

(3)  (l<i<^n)  is  derived  from  by  extension  by  a member  of 

S,  by  reduction,  by  factorization,  by  B^extension  by  a member  of 
S,  or  by  ^“reduction. 

A ME^P  xefutaiiQD  is  a ME-P  deduction  of  the  empty  chain  0. 

2.6.10.  Permissability.  A permissable  chain  is  a chain  in 
which 

(1)  no  A-literal  is  followed  by  a complementary  A-literal, 

(2)  no  B-literal  is  followed  by  an  identical  A-literal,  and 

(3)  the  last  literal  of  the  chain  is  not  an  A-literal.  A chain 
is  termed  pregermissable  if  it  satisfies  requirements  (1)  and 
(2)  . 
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2.6.11.  Extension.  The  MC-P  extension  opecation  is  the  same 
as  for  ME  except  it  operates  on  and  produces  permissable  rather 
than  admissable  chains. 

2.6.12.  Reduction.  The  ME-P  reduction  operation  is  the  same 
as  for  ME  except  it  operates  on  and  produces  permissable  rather 
than  admissable  chains. 

2.6.13.  Factorization.  The  ME-P  factorization  operation  is 
the  same  as  for  strong  ME  except  it  operates  on  and  produces 
permissable  rather  than  acceptable  chains. 

2.6.14.  P-extension.  The  p-extension  operation  takes  a 
permissable  chain  K and  an  input  clause  C as  its  arguments.  Let 
K*  and  C be  variable  disjoint  variants  of  K and  C. 

(a)  If  a literal  of  C is  of  the  form  a=b  or  b=a  where  a is 
unifiable  with  a term  in  the  last  literal  of  K',  the  chain 
consisting  of  K*  followed  by  the  remaining  literals  of  C 
followed  by  a copy  of  the  last  literal  of  K*  with  a single 
instance  of  a replaced  by  b with  unifying  substitution  applied  is 
formed.  Each  literal  of  the  derived  chain  descended  from  a 
literal  of  K*  is  designated  to  be  the  same  type  of  literal  as  its 
ancestor  except  the  last  which  is  designated  to  be  an  A-literal; 
each  literal  of  the  derived  chain  descended  from  a literal  of  C 
is  designated  to  be  a B-literal;  the  last  literal  of  the  derived 
chain  (in  which  an  instance  of  a term  matching  a was  replaced  by 
b)  is  designated  to  be  a B-literal.  This  form  of  p-extension  is 
called  p-extension  from  an  input  clause. 

(b)  If  the  last  literal  of  K*  is  a«b  or  b>a  where  a is  unifiable 
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with  a term  in  a literal  of  C,  the  chqin  consisting  of  K' 
followed  by  the  remaining  literals  of  c*  followed  by  a copy  of 
the  literal  containing  the  term  unified  with  a with  a single 
instance  of  that  term  replaced  by  b with  unifying  substitution 
applied  is  formed.  Each  literal  of  the  derived  chain  descended 
from  a literal  of  K’  is  designated  to  be  the  same  type  of  literal 
as  its  ancestor  except  the  last  which  is  designated  to  be  an 
A-literal;  each  literal  of  the  derived  chain  descended  from  a 
literal  of  C is  designated  to  be  a B-literal;  the  last  literal 
of  the  derived  chain  (in  which  an  instance  of  a term  matching  a 
was  replaced  by  b)  is  designated  to  be  a B-literal.  This  form  of 
p-extension  is  called  p-extension  to  an  input  clause. 

This  can  be  expressed  schematically  as 


(a) 

input  chain  a C 
inpih:  clause  v . . . v 
where  is  a=b  or  b=a 


derived  chain  a [C]  ... 

where  C is  C with  a single  a rep 


by  b. 


(b) 

input  chain  a C 
input  clause  ''  . . . v 
where  C is  a=b  or  b=a 


derived  chain  a (C)  ...  ...  Cj[ 

where  C{  is  Cf  with  a single  a replaced  by  b. 

2.6.15.  P-reduction.  The  p-reduction  operation  takes  an 

acceptable  chain  K as  its  argument. 

(a)  If  the  last  literal  of  K contains  a term  unifiable  with  the 
term  a where  a preceding  A-literal  of  K is  of  the  form  a=b  or 
b*a,  the  chain  consisting  of  K followed  by  a copy  of  the  last 
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literal  with  a single  instance  of  the  term  matching  a replaced  by 
b with  unifying  substitution  applied  is  formed.  Each  literal  of 
the  derived  chain  descended  from  a literal  of  K is  Uesignated  to 
be  the  same  type  of  literal  as  its  ancestor  except  the  last  which 
is  designated  to  be  an  A-literal;  the  last  literal  of  the  derived 
chain  (in  which  an  instance  of  a term  matching  a was  replaced  by 
b)  is  designated  to  be  a B-literal.  This  form  of  p-reduction  is 
called  p-reduction  from  an  A-literal. 

(b)  If  the  last  literal  of  K is  of  the  form  a=b  or  b*a  where  a is 
unifiable  with  a term  in  a preceding  A-literal  of  K or  the  last 
literal  of  K itself,  the  chain  consisting  of  K followed  by  a copy 
of  the  preceding  A-literal  or  last  literal  with  a single  instance 
of  the  term  matching  a replaced  by  b with  unifying  substitution 
applied  is  formed.  Each  literal  of  the  derived  chain  descended 
from  a literal  of  K is  designated  to  be  the  same  type  of  literal 
as  its  ancestor  except  the  last  which  is  designated  to  be  an 
A-literal;  the  last  literal  of  the  derived  chain  (in  which  an 
instance  of  a term  matching  a was  replaced  by  b)  is  designated  to 
be  a B-literal.  This  form  of  p-reduction  is  called  p-reduction 
to  an  A-literal  or  self. 

This  can  be  expressed  schematically  as 
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(a) 

input  chain  ^ (DJ  g C 
where  D is  a=b  or  b=a 


derived  chain  a ID]  b IC]  C 
where  C'  is  C with  a single  a replaced  by  b. 

(b) 

input  chain  a ICJ  b D 
where  D is  a=b  or  b=a 


derived  chain  a [C]  b IDJ  C 
where  C ' is  C with  a single  a replaced  by  b. 

C and  D may  be  the  same  literal  in  p- reduction  to  self. 


2.7.  MESON  theorem  proving  procedure 

The  MESON  procedures  were  first  reported  on  in  [32].  These 
procedures  are  complete  extensions  motivated  by  the  model 
elimination  theorem  proving  procedure  of  the  problem  reduction 
method  operating  on  AND/OR  goal  trees  (MESON  stands  for  "model 
elimination  subgoal  priented) . Here,  they  are  presented  for 
propositional  (variable  free)  problems  only  with  an  occasional 
reference  to  the  requirements  of  the  procedures  utilizing 
variables . 

The  procedures  represent  syntactic  systems  for  adding  to  a 
goal  tree  information  about  goal-subgoal  relationships  and 
establishment  of  goals.  The  procedures  return  success  or  failyifi 
according  to  whether  the  top  goal  can  be  established  or  not 
respectively.  Of  course,  the  ability  to  return  failure 
disappears  when  substitution  is  allowed,  e.g.,  first  order 
formulations.  A returned  value  of  failure  for  a problem 
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indicates  either  the  top  (]oal  docs  not  follow  from  the  assertions 
or  the  search  ordering  and  goal  generation  and  deletion 
strategies  specified  by  the  planning  routine  are  inadequate  for 
the  problem.  (It  is  possible  to  write  a complete  planning 
routine  which  theoretically  always  returns  success  for  solvable 
problems. ) 

We  will  now  present  two  MESON  procedures  for  goal  tree 
analysis  incorporating  the  new  rules  discussed  above.  The 
procedures  are  logically  divided  into  four  subprocedures  with 
labels  "initialize",  "loop",  ■update_marks" , and  "update_goals" . 

The  instructions  placed  at  the  label  "initialize"  define  GOALS 
(the  set  of  goals  to  be  attempted)  to  be  the  set  consisting  of 
only  the  top  goal  and  also  initialize  the  goal  tree. 

The  instructions  placed  at  the  label  "loop"  select  a goal  G 
from  GOALS,  an  operation  to  be  performed,  and  an  assertion  D if 
needed.  The  selected  operation  is  then  performed  for  the  goal  G 
and  assertion  D.  Those  operations  try  to  establish  goals  or 
create  subgoals. 

The  instructions  placed  at  the  label  "update_mar)cs"  add  newly 
generated  subgoals  to  the  tree  and  GOALS  provided  certain 
acceptance  criteria  are  met. 

The  selection  of  the  next  goal  in  GOALS  to  be  operated  upon 
and  the  selection  of  the  operation  and  the  assertion  to  be  used 
in  operating  on  that  goal  are  assumed  to  be  accomplished  by  some 
externally  specified  planning  routine  ("the  planner").  The 
planner,  in  addition  to  specifying  a search  strategy,  may 


restrict  or  totally  eliminate  use  of  some  operations.  For 
example^  traditional  goal  tree  procedures  without  a reasoning  by 
contradiction  mechanism  correspond  to  a planner  which  never  uses 
the  operation  at  "op3". 

The  planner,  by  applying  the  operation  at  "op5"  to  a goal, 
removes  the  goal  from  GOALS  and  thereby  signifies  that  no  more 
operations  will  be  applied  to  the  goal. 

If  one  wishes  to  insure  completeness,  the  planner  must  in  some 
order  process  all  operations  (except  the  operation  at  "op5")  for 
each  goal  and  potentially  applicable  assertion.  The  planner  may 
select  the  goals  of  a conjunctive  set  of  goals  in  any  desired 
order  to  attempt  their  establishment.  The  procedures  make  no 
assumption  as  to  whether  the  search  is  depth-first, 
breadth-first,  or  some  mixture  of  these. 

2.7.1.  MESON  procedure. 

initialize:  Let  GOALS  be  a set  consisting  of  only  the  top  goal. 

Initialize  the  goal  tree  to  the  top  goal. 

loop:  If  GOALS  is  empty,  exit  procedure  with  failure. 

Otherwise,  let  G be  a goal  in  GOALS  selected~by  the 
planner.  The  planner  selects  one  of  the  following 
operations  to  be  performed  on  G and  selects  0,  a premise, 
implication,  or  general  contrapositive  of  implication,  as 
required  by  the  operation. 


opl:  If  G matches  the  premise  D,  mark  G established  and 

go  to  update_marks . Otherwise  go  to”IoopT 

op2:  If  G matches  the  consequent  of  0,  where  D is  an 

implication  or  general  contrapositive  of 
implication,  let  A be  the  set  of  antecedents  of  D 
and  go  to  update^goals.  Otherwise  go  to  loop. 

op3:  If  G matches  the  complement  of  an  ancestor  of  G, 
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mark  G contradicted  and  go  to  update  marks. 
Otherwise  go  to  loop. 

op4;  If  G matches  an  unexpanded  partner  of  G not  marked 

or  an  unexpanded  ancestor  partner  of  G, 
mark  G oisglaced  and  go  to  update  marks.  Otherwise 
go  to  loop.  ~ 


op5:  Delete  G from  GOALS  and  go  to  loop. 


update^marks;  If  G is  top  goal,  exit  procedure  with  success.  if 
all  partner  goals  of  G are  marked  established, 
SQntradigted,  or  disglaced,  let  Gj^  be  the  parent  of  G,  set 
G»G2,  mark  G established  and  go  to  update__marks . 

Otherwise  go  to'IoopT 

update_goals: 

testl;  If  a member  of  A is  identical  to  G or  an  ancestor 
of  G,  go  to  loop. 

test2:  If  a member  of  A is  complementary  to  another 

member  of  A,  an  unexpanded  partner  of  G,  or  an 
unexpanded  ancestor  partner  of  G,  go  to  loop. 


Otherwise  add  the  member  of  A to  GOALS  and  to  the  goal 
tree  as  a conjunctive  set  of  successors  of  G and  go  to 
loop. 

2.7.2.  MESON  procedure  for  equality.  The  MESON  procedure  for 
equality  incorporates  rules  for  handling  the  equality  relation. 

It  differs  from  the  MESON  procedure  in  that  three  new  operations 
are  added.  Also,  the  rules  for  disregarding  newly  generated 
subgoals  (at  "testl"  and  "test2")  have  not  been  proven  to 
preserve  completeness  although  we  believe  completeness  is 
preserved  with  these  rules  applied.  We  maintain  the  update^goals 
subprocedure  in  the  MESON  procedure  in  the  MESON  procedure  for 
equality  with  the  admonition  that  if  completeness  is  to  be 
preserved  these  rules  should  be  bypassed  (at  present) . 

For  technical  reasons,  it  is  necessary  to  put  in  premises  of 
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the  form  a=a  -for  eacn  term  a or,  i£  in  a setting  using  -free 
variables  and  substitutions,  one  must  put  in  and 

f , . . . ,Xyj)  «.f  ( xj  , . . . ,Xn)  'for  each  n-ary  function  symbol  ■£.  Such 
axioms  can  be  replaced  by  appropriate  procedure  rules  if  desired. 


MESON  pcQcsdjJLfi  -iQi:  egudllt^ 
initialize:  (same  as  -for  MESON  procedure) 

loop:  (preface  and  operations  1-5  same  as  for  MESON  procedure; 

only  change  is  the  addition  of  the  -following  operations) 


op6:  If  G contains  a term  matching  term  a where  a=b  or  b*a 
is  the  consequent  of  D,  where  D is  a premise, 
implication,  or  general  contrapositive  of 
implication,  let  A oe  the  set  consisting  of  G with  a 
single  instance  of  a replaced  by  b plus  the 
antecedents  of  D and  go  to  update__goals.  Otherwise 
go  to  loop. 

op7:  If  the  consequent  of  0,  where  0 is  a premise, 

implication,  or  general  contrapositive  of 
implication,  contains  a term  matching  a where  G is 
a/b  or  b^a,  let  A be  the  set  consisting  of  the 
complemented  consequent  of  D with  a single  instance 
of  a replaced  by  b plus  the  antecedents  of  D and  go 
to  update_goals.  Otherwise  go  to  loop. 

op8:  If  H is  an  ancestor  of  G or  G itself  and  H (resp.  G) 

contains  a term  matching  a where  G (resp.  H)  is  a/b 
or  b/a,  let  A be  the  set  consisting  of  H (resp.  G) 
with  a single  instance  of  a replaced  by  b and  go  to 
update_goals.  Otherwise  go  to  loop. 

update^mar ks:  (same  as  -for  MESON  procedure) 

update  goals:  (same  as  for  MESON  procedure) 


2.8.  The  'Linear  MESON  theorem  proving  procedure 

The  linear  MESON  procedure  is  a variant  of  the  model 
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elimination  theorem  proving  procedure  in  which  (1)  each  literal 
of  the  top  chain  and  the  derived  chains  of  a deduction  is 
replaced  by  its  complement  and  (2)  implications  as  well  as 


disjunctions  are  permitted  as  axioms.  It  is  also  the  linear  form 
of  the  MESON  procedure  specified  for  goal-subgoal  trees. 
Advantages  of  the  linear  MESON  procedure  are  (1)  the  linear  MESON 
procedure,  though  logically  equivalent  to  model  elimination,  has 
the  form  of  an  affirmation  rather  than  a refutation  procedure  and 
its  proofs  have  a very  natural  interpretation  in  terms  of 
goal-subgoal  trees  and  (2)  the  linear  MESON  procedure  has  greater 
expressive  power  than  alternative  procedures  in  the  potential 
use  of  implications  rather  than  disjunctions  to  restrict 
application  of  inference  operations  to  certain  literals  of  axioms 
or  (by  replacing  a disjunction  by  more  than  one  implication)  to 
facilitate  differential  treatment  of  the  various  implicative 
forms  of  axioms  during  the  search  for  a solution  (this  capability 
is  shared  by  model  elimination  in  which  each  length  n input 
clause  generates  n auxiliary  chains  only  the  last  literal  of 
which  can  be  matched  in  inference  operations) . 

In  a theorem  proving  program  permitting  interaction  between 
human  user  and  mechanical  proof  procedure,  it  is  desirable  that 
as  human-oriented  a procedure  as  possible  be  employed.  While  it 
is  correctly  argued  that  all  resolution  type  theorem  proving 
procedures  are  machine-oriented  and  unsuited  to  extensive  human 
computation,  it  is  our  contention  that  the  linear  MESON  procedure 
is  more  human-oriented  than  other  resolution  related  procedures. 
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This  is  a direct  consequence  of  the  relationship  of  the  linear 
MESON  procedure  with  the  problem- reduction  method.  The  linear 
MESON  procedure  is  an  extension  of  the  problem-reduction  method 
which  is  complete  for  the  first  order  predicate  calculus  with 
equality.  It  augments  the  problem-reduction  method  by  inference 
operations  which  perform  reasoning  by  contradiction  (reduction), 
which  eliminate  duplicate  subgoals  (factorization) , and  which 
deal  with  the  equality  relation  (p-extension,  p-reduction) . 

The  linear  MESON  procedure  represents  the  state  of  a search 
for  solution  as  a set  of  chains.  Each  chain  represents  a subtree 
of  the  search  space.  The  solution  of  all  the  subgoals 
represented  in  a chain  constitutes  a solution  of  the  top  goal. 
Different  chains  represent  different  alternative  partial 
solutions  of  the  top  goal. 

More  specific  claims  of  the  linear  MESON  procedure  being 
human-oriented  are  (1)  it  is  an  affirmation  rather  than  a 
refutation  procedure,  (2)  in  keeping  literals  ordered  in  a chain, 
it  automatically  prevents  (in  goal-subgoal  tree  terms)  the  start 
of  an  attempt  to  solve  another  subgoal  in  the  chain  until  the 
current  one  is  solved,  and  (3)  it  is  a procedure  which  remains 
complete  if  only  input  deductions  are  used.  An  input  deduction 
is  a deduction  in  which  each  element  of  the  deduction  (a  linear 
MESON  procedure  chain)  is  derived  by  an  inference  operation 
applied  to  its  predecessor  or  its  predecessor  and  an  (input) 
axiom. 

In  combination,  the  last  two  items  permit  the  user  to  focus 
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his  attention  on  a much  smaller  subset  of  the  available  data  than 
is  possible  for  many  resolution  based  procedures. 

In  preparing  a problem  for  input  to  the  linear  MESON 
procedure,  it  is  first  Skolemized  to  eliminate  all  quantifiers. 
This  can  be  accomplished  by  the  following  procedure. 

Let  A be  a formula  using  only  the  y,  and  -•  logical 
connectives.  Occurrences  of  can,  of  course,  be  replaced  by 
B«C  A C«B  or  BaC  V -iBa-^.  etc.,  to  so  restrict  connective  usage. 

Let  A be  further  restricted  such  that  no  quantifier  quantifies 
a free  variable  of  A or  any  variable  already  quantified  at  that 
point  i.e..  the  scopes  of  variables  with  the  same  name  are 
disjoint.  A renaming  of  variables  can  guarantee  this  condition. 

A positive  occurrence  of  a quantifier  in  A is  a quantifier  in 
the  scope  of  an  even  number  (possibly  zero)  of  negations,  either 
explicit  (in  the  case  of  the  -•  connective)  or  implicit  (in  the 
case  of  occurring  in  an  antecedent  of  an  implication) . 

The  formula  is  Skolemized  by  (1)  replacing  each  variable 
quantified  by  a positively  occurring  universal  quantifier  or  a 
negatively  occurring  existential  quantifier  by  a unique  Skolem 
function  whose  arguments  are  the  variables  of  all  positively 
occurring  existential  quantifiers  or  negatively  occurring 
universal  quantifiers  in  whose  scope  the  quantifier  of  the 
variable  being  replaced  appears  and  (2)  deleting  all  quantifiers 
from  the  formula. 

The  resulting  quantifier  free  formula  is  then  converted  to  a 
conjunction  of  assertions  implying  a ccDClusiSQ*  An  assertion  is 
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a (possibly  empty)  conjunction  of  literals  (antecedents)  implying 
a disjunction  of  literals  (conseguents) . Note  that  if  there  are 
no  antecedents  in  the  assertion,  the  assertion  is  just  a 
disjunction  of  literals  (disguncts) , i.e.,  a clause.  A 
conclusion  is  a conjunction  of  literals.  Schematically,  the 
transformed  formula  is  in  the  form 

( (A  Ja  . . . aa1^*c}v  . . . vcj^ ) A . . . A ( Af  A . . . . . . vCgp)  ♦ (Gi^  . . . AGq)  ) 


where  A,  C and  G denote  literals,  and  p^l , q^l,  each  m]^>0,  each 


nj^l. 


Examglg  24.12*  The  problem  is:  if  a is  a prime  number  and  a 
times  the  square  of  some  number  u is  b then  a divides  b.  The 
initial  formula  is: 

VxVyVzVw  (prime (x)  a y*z»w  a divides (x,w) 

♦ divides(x,y)  v di vides (x, z) ) 

A Vx  x*x»square (x) 

A VxVyVz  (x*y*z  ♦ y*x»z) 

A VxVyVz  (x*y*z  ♦ divides (x, z) ) 

A 9u  a*square (u) ^square (b) 

A prime (a) 

* divides(a,b) 

The  Skolemized  form  ready  for  input  to  the  linear  MESON  procedure 


is: 


(prime(x)  a y*z«w  a divides(x,w)  ♦ divides(x,y)  v divides (x, z) ) 
A x*x>square (x) 

A (x*y»z  y*x»z) 

A (x*y*z  ♦ divides (x, z) ) 

A a*square (c) »square (b) 

A prime(a) 

•»  divides(a,b) 

Problems  with  equality  (as  above)  can  be  introduced  without 
the  need  for  specifying  the  symmetry,  transitivity,  and 
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substitutivity  axioms  if  the  special  equality  inference 
operations  (p-extension , p-reduction)  are  used.  The  equality 


reflexive  (x=x)  and  functionally  reflexive  axioms  (e.g., 
square ( x) ^square (x) , x*y=x*y)  theoretically  are  required.  The 
latter  are  not  present  in  the  above  example  since  no  special 
equality  rules  are  required  for  the  problem's  solution. 

By  virtue  of  its  derivation  from  model  elimination,  the  linear 
MESON  procedure  is  complete  (given  that  a compatible  set  of 
inference  and  postprocessing  operations  are  used)  provided  (1) 
the  set  of  assertions  is  consistent  (this  requirement  is 
equivalent  to  the  requirement  that  the  top  chain  of  a model 
elimination  deduction  be  in  the  minimally  unsati sfiable  set  of 
input  clauses)  and  (2)  either  the  disjunctive  axiom  form  is  used 
or  ed.1  implicative  forms  of  each  assertion  are  included  among  the 
assertions.  The  first  condition  can  be  eliminated  by  the 
addition  of  a special  contradiction  mechanism  defined  for  the 
MESON  procedure  but  not  included  in  this  formulation  of  the 
linear  MESON  procedure  which  permits  the  proof  of  any  conclusion 
from  an  inconsistent  set  of  assertions.  The  second  condition  can 
sometimes  be  eliminated  in  practice  since  it  is  often  clear  from 
the  problem  structure  (as  in  the  case  of  Horn  formulas)  that  use 
of  a subset  of  the  implicative  forms  results  in  the  possible 
deduction  of  all  the  chains  that  the  disjunctive  form  would. 

Also,  although  the  resulting  procedure  is,  in  general,  incomplete 
it  is  sometimes  desirable  to  restrict  the  search  for  a proof  by  I 

not  presenting  the  procedure  with  all  the  implicative  forms  of 
the  assertions. 
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The  fundamental  element  of  a linear  MESOii  procedure  deduction 
is  the  chain,  a chain  is  an  ordered  sequence  of  literals.  Two 
types  of  literals  are  distinguished:  A^literals  and  BrliteialS. 
B-literals  correspond  to  the  literals  present  in  clauses  in 
resolution  theorem  proving.  A-literals  record  ancestry 
information  and  represent  (in  goal-subgoal  tree  terms)  higher 
goals.  All  the  literals  in  the  theorem  and  axioms  are 
B-literals.  An  A-literal  is  created  in  a newly  derived  chain 
from  a B-literal  in  the  parent  chain  when  a set  of  literals  whose 
conjunction  implies  the  A-literal  (a  set  of  subgoals  whose 
solution  constitutes  a solution  to  the  goal  represented  by  the 
A-literal)  is  added. 


5SfiDi£i2D  • A liQsat  meson  procedure  of  chain 

K from  problem  P is  a sequence  Kq,...,Kj,  of  asgegtable  chains 
such  that 

(1)  Kq  is  the  conclusion  of  P, 

(2)  is  K, 

(3)  (l£i£n)  is  derived  from  by  extension  (by  an  assertion 

Ci)  » i§££or iaatign,  Reduction,  grextension  (by  an  assertion  C^) 
or  P:IS^i)££iSQ>  where  is  an  assertion  of  P or  a lemma  (see 
lemma  formation  operation  below) . 

^ fi£2£§^2£§  §2ii}£i2I}  ^ ££22l  of  the 

conclusion  of  P)  is  a linear  MESON  procedure  deduction  of  the 
empty  chain  0.  In  general,  for  the  inference  system  to  be 
complete,  the  negation  of  the  conclusion  must  be  included  among 
the  assertions.  The  definitions  of  acceptable  chains  and  the 
inference  operations  are  given  below. 
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2.8.1.  Matching.  If  the  two  arguments  to  the  matching 
procedure  are  terms,  the  matching  procedure  returns  the  most 
general  unifier  of  the  terms.  If  the  two  terms  are  not 
unifiable,  the  matching  procedure  fails. 

If  the  two  arguments  to  the  matching  procedure  are  literals 
and  are  both  positive  (unnegated)  or  both  negative  (negated) 
literals  with  unifiable  atomic  formulas,  the  matching  procedure 
returns  the  most  general  unifier  of  the  atomic  formulas.  If  the 
two  arguments  are  not  both  positive  or  both  negative  or  the 
atomic  formulas  are  not  unifiable,  the  matching  procedure  fails. 

2.8.2.  Extension.  The  extension  operation  takes  an  acceptable 
chain  K and  an  assertion  C as  its  arguments.  Let  K'  and  C be 
variable  disjoint  variants  of  K and  C.  If  the  last  literal  of  K' 
matches  a consequent  of  C,  the  chain  consisting  of  K'  followed 
by  the  antecedents  and  the  complements  of  the  remaining 
consequents  of  C with  matching  substitution  applied  can  be 
inferred.  Each  literal  of  the  derived  chain  descended  from  a 
literal  of  K*  is  designated  to  be  the  same  type  of  literal  as  its 
ancestor  except  the  last  which  is  designated  to  be  an  A-literal; 
each  literal  of  the  derived  chain  descended  from  a literal  of  C 
is  designated  to  be  a B-literal. 

This  can  be  expressed  schematically  as 

input  chain  a 

input  assertion  Aj  . . . a v • • • v 

derived  chain  a ICj^J  A^  ...  A„  -C^  ...  "^i+l  •••  ~<^n* 
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2.8.3.  Factorization.  The  factorization  operation  takes  an 
acceptable  chain  K as  its  argument.  If  the  last  literal  of  K 
matches  a preceding  B-literal  of  K,  the  chain  consisting  of  K 
with  the  last  literal  removed  and  with  matching  substitution 
applied  can  be  inferred.  Each  literal  of  the  derived  chain  is 
designated  to  be  the  same  type  as  its  ancestor. 

This  can  be  expressed  schematically  as 

input  chain  a C e C 
derived  chain  a C 8. 

2.8.4.  Reduction.  The  reduction  operation  takes  an  acceptable 
chain  K as  its  argument.  If  the  last  literal  of  K matches  the 
complement  of  a preceding  A-literal  of  K,  the  chain  consisting  of 
K with  the  last  literal  removed  and  with  matching  substitution 
applied  can  be  inferred.  Each  literal  of  the  derived  chain  is 
designated  to  be  the  same  type  as  its  ancestor. 

This  can  be  expressed  schematically  as 

input  chain  o [C]  8 ~C 
derived  chain  a [C]  8. 

2.8.5.  P-extension.  The  p-extension  ("p-"  for  par amodulation) 
operation  takes  an  acceptable  chain  K and  an  assertion  C as  its 
arguments.  Let  K*  and  C be  variable  disjoint  variants  of  K and 
C.  (a)  If  a consequent  of  C is  of  the  form  a=b  or  b=a  where  a 
matches  a term  in  the  last  literal  of  K',  the  chain  consisting  of 
K*  followed  by  the  antecedents  and  the  complements  of  the 
remaining  consequents  of  C followed  by  a copy  of  the  last 
literal  of  K'  with  a single  instance  of  the  term  matching  a 
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replaced  by  b with  matching  substitution  applied  can  be  inferred. 
Each  literal  of  the  derived  chain  descended  from  a literal  of  K' 
is  designated  to  be  the  same  type  of  literal  as  its  ancestor 
except  the  last  which  is  designated  to  be  an  A-literal;  each 
literal  of  the  derived  chain  descended  from  a literal  of  C ' is 
designated  to  be  a B-literal;  the  last  literal  of  the  derived 
chain  (in  which  an  instance  of  a term  matching  a was  replaced  by 
b)  is  designated  to  be  a B-literal.  This  form  of  p-extension  is 

called  p-extension  from  an  assertion.  (b)  If  the  last  literal  of 
K*  is  of  the  form  a/b  or  b^a  where  a matches  a terra  in  a 
consequent  of  C,  the  chain  consisting  of  K'  followed  by  the 
antecedents  and  the  complements  of  the  remaining  consequents  of 
C followed  by  a copy  of  the  complement  of  the  consequent 
containing  the  term  matching  a with  a single  instance  of  that 
term  replaced  by  b with  matching  substitution  applied  can  be 
inferred.  Each  literal  of  the  derived  chain  descended  from  a 
J-iteral  of  K*  is  designated  to  be  the  same  type  of  literal  as  its 
ancestor  except  the  last  which  is  designated  to  be  an  A-literal; 
each  literal  of  the  derived  chain  descended  from  a literal  of  C 
is  designated  to  be  a B-literal;  the  last  literal  of  the  derived 
chain  (in  which  an  instance  of  a term  matching  a was  replaced  by 
b)  is  designated  to  be  a B-literal.  This  form  of  p-extension  is 
called  p-extension  to  an  assertion. 

This  can  be  expressed  schematically  as 
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-Cn  C- 


(a) 

input  chain  a C 

assertion  a . . . a v . . . v 

where  is  a=b  or  b=a 


derived 


chain 

where 


a IC]  A 


IS 


1 • • • 

C with 


-c- 


a single  a 


''Ci-1  + l •• 

replaced  by  b. 


(b) 

input  chain  a C 

assertion  a . . . a a ♦ C,  ^ . v 

where  C is  a=b  or  5=a 

derived  chain  a [C]  Aj^  ...  A^,  -Cj  ...  ~Ci_i  ~C^  + n ...  -Cn-Ci 
where  is  with  a single  a replacea  by  b. 

2.8.6.  P-reduction.  The  p-reduction  ("p-"  for  par amodulation) 
operation  takes  an  acceptable  chain  K as  its  argument.  (a)  If 
the  last  literal  of  K contains  a term  matching  the  term  a where  a 
preceding  A-literal  of  K is  of  the  form  a/b  or  b/a,  the  chain 
consisting  of  K followed  by  a copy  of  the  last  literal  with  a 
single  instance  of  the  term  matching  a replaced  by  b with 
matching  substitution  applied  can  be  inferred.  Each  literal  of 
the  derived  chain  descended  from  a literal  of  K is  designated  to 
be  the  same  type  of  literal  as  its  ancestor  except  the  last  which 
is  designated  to  be  an  A-literal;  the  last  literal  of  the  derived 
chain  (in  which  an  instance  of  a term  matching  a was  replaced  by 
b)  is  designated  to  be  a B-literal.  This  form  of  p-reduction  is 
called  p-reduction  from  an  A-literal.  (b)  If  the  last  literal  of 
K is  of  the  form  a^b  or  b/a  where  a matches  a term  in  a preceding 
A-literal  of  K or  the  last  literal  of  K itself,  the  chain 
consisting  of  K followed  by  a copy  of  the  preceding  A-literal  or 
last  literal  with  a single  instance  of  the  term  matching  a 
replaced  by  b with  matching  substitution  applied  can  be  inferred. 
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Each  literal  of  the  derived  chain  descended  from  a literal  of  K 
is  designated  to  be  the  same  type  of  literal  as  its  ancestor 
except  the  last  which  is  designated  to  be  an  A-literal;  the  last 
literal  of  the  derived  chain  (in  which  an  instance  of  a term 
matching  a was  replaced  by  b)  is  designated  to  be  a B-literal. 
This  form  of  p-reduction  is  called  p-reduction  to  an  A-literal  or 
self. 

This  can  be  expressed  schematically  as 

(a) 

input  chain  q [D]  b C 
where  D is  a»b  or  b^a 

derived  chain  a [D]  8 [CJ  C* 
where  C is  C with  a single  a replaced  by  b. 

(b) 

input  chain  a [C]  6 0 
where  D is  a=b  or  b*a 

derived  chain  a [C]  8 (0]  C* 
where  C is  C with  a single  a replaced  by  b. 

C and  D may  be  the  same  literal  in  p-reduction  to  self. 

If  the  parent  chain  K in  the  p-extension  or  p-reduction 
operation  is  itself  derived  by  p-extension  or  p-reduction,  the 
created  A-literal  in  the  derived  chain  may  optionally  be  omitted 
with  completeness  unaffected  (see  [31]).  There  is  a tradeoff 
here.  If  the  A-literal  is  omitted,  the  derived  chain  is  shorter 
and  easier  to  read,  and  some  future  possible  reductions  and 
p- reductions  may  be  eliminated.  On  the  other  hand,  especially  if 
the  postprocessing  operation  specifies  rejection  of  chains 
containing  an  A-literal  followed  by  an  identical  A-literal  or 
■-literal,  retention  of  the  A-literal  may  result  in  rejection  of 


73 


more  chains  as  being  unacceptable.  For  example,  this  could 
prevent  repeated  p-extension  by  a=b  from  creating  an  endless 
sequence  of  chains  ending  alternately  in  Pa  or  Pb. 

2.8.7.  Postprocessing.  A postprocessing  operation  takes  a 
chain  K (output  from  the  extension,  factorization,  reduction, 
p-extension  or  p-reduction  operation)  as  its  argument  and  either 
rejects  K as  being  non- acceptable  and  thus  unusable  as  input  to 
any  inference  operation  or  transforms  the  chain  into  an 
acceptable  chain.  Many  different  postprocessing  operations  can 
be  written  with  different  effects  regarding  efficiency  and 
completeness.  Four  postprocessing  operations  are  described  in 
the  following  table.  The  table  expresses  possible  relationships 
between  each  pair  of  literals  in  the  chain.  All  the  actions 
corresponding  to  true  conditions  are  to  be  performed  on  the 
chain,  except,  of  course,  that  if  the  action  is  to  reject  the 
chain  then  no  other  conditions  need  be  checked  or  actions  need  be 


performed . 


F 


Postprocessing 

operations 

STRONG- 

SAVE 

STRONG- 

DELETE 

WEAK- 

SAVE 

V^EAK- 

DELETE 

Condition 

Action 

A-litefal  followed  by 
identical  A-literal 

r'ej  ect 
chain 

i ej  ect 
cha  in 

reject 

chain 

rej  ect 
chain 

A-literal  followed  by 
complementary  A-literal 

reject 

chain 

reject 

chain 

rej  ect 
chain 

rej  ect 
chain 

A-literal  followed  by 
identical  B-literal 

rej  ect 
chain 

reject 

chain 

reject 

chain 

reject 

chain 

A-literal  followed  by 
complementary  B-literal 

delete 

following 

B-literal 

delete 

following 

B-literal 

B-literal  followed  by 
identical  A-literal 

rej  ect 
chain 

reject 

chain 

B-literal  followed  by 
complementary  A-literal 

rej  ect 
chain 

rej  ect 
chain 

B-literal  followed  by 
identical  B-literal 

delete 

following 

B-literal 

delete 

following 

B-literal 

B-literal  followed  by 
complementary  B-literal 

rej  ect 
chain 

rej  ect 
chain 

If  all  the  actions  of 

the  postprocessing  operation  specified 

in  the  table  have  been  performed 

and  the  chain 

is  not  rejected. 

all  terminal  A-literals  of  the  chain  are  deleted.  This  terminal 


A-literal  deletion  is  called  contraction. 

The  deletion  action  associated  with  the  "B-literal  followed  by 
identical  B-literal"  condition  is  called  3£ound_factor ization 
since  it  represents  an  instantiation-free  usage  of  a generalized 
factorization  operation  which  can  delete  non-terminal  B-literals. 
Similarly,  the  deletion  action  associated  with  the  "A-literal 
followed  by  complementary  B-literal"  condition  is  called 
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3£°i^G^_£®^H2^i2D  since  it  represents  an  instantiation-iree  usage 
of  a generalized  reduction  operation  which  can  delete 
non-terminal  B-literals. 


Several  additional  conditions  and  actions  can  be  used  in 
postprocessing  operations  such  as  the  following  which  are 
available  in  PSTP  but  were  not  used  in  the  present  study:  (1) 
rejecting  chains  containing  a non- terminal  A-literal  that  is  an 
instance  of  a unit  axiom  (single  literal  input  assertion),  (2) 
rejecting  chains  containing  a non-terminal  A-literal  that  is  an 
instance  of  a unit  lemma  (single  literal  derived  assertion,  see 
lemma  formation  operation  below)  by  which  the  ancestor  chain 
could  have  been  extended  when  the  A-literal  was  created,  (3) 
deletion  of  B-literals  which  are  instances  of  unit  axioms,  (4) 
deletion  of  B-literals  which  are  instances  of  unit  lemmas,  (5) 
removal  of  all  literals  including  and  following  the  A-literal  in 
the  case  of  a B-literal  followed  by  an  identical  A-literal  (this 
is  called  factorization_truncation) , (6)  removal  of  all  literals 
including  and  following  the  second  A-literal  in  the  case  of  an 
A-literal  followed  by  a complementary  A-literal  (this  is  called 
reduction_truncation) . All  of  these  actions  can  be  shown  to 
preserve  completeness. 

For  problems  not  involving  the  equality  relation,  the 
WEAK-SAVE  and  WEAK-DELETE  postprocessing  operations  yield  a 
complete  inference  system  when  the  extension  and  reduction 
operations  are  used.  The  WEAK  postprocessing  operations 
correspond  closely  to  the  chain  admissabil ity  criteria  for  model 
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elimination  of  [29].  Also  for  problems  not  involving  the 
equality  relation r the  STRONG-SAVE  and  STRONG-DELETE 
postprocessing  operations  yield  a complete  inference  system 
provided  the  factorization  operation  is  used  in  addition  to 
extension  and  reduction.  The  STRONG  postprocessing  operations 
correspond  closely  to  the  chain  acceptability  criteria  for  strong 
model  elimination  of  [30].  Note  that  with  these  postprocessing 
operations,  if  the  conclusion  is  variable  free,  its  negation  need 
not  be  included  among  the  assertions  since  extension  by  the 
negation  of  the  conclusion  in  any  deduction  from  the  conclusion 
would  result  in  a non- acceptable  chain  with  an  A-literal  followed 
by  an  identical  B-literal  or  an  A-iiteral  followed  by  a 
complementary  A-literal. 

For  problems  involving  the  equality  relation,  a postprocessing 
operation  which  rejects  a chain  only  if  it  has  an  A-literal 
followed  by  a complementary  A-literal  yields  a complete  inference 
system  when  the  extension,  reduction,  p-extension  and  p-reduction 
operations  are  used.  Also  for  problems  involving  the  equality 
relation,  a postprocessing  operation  which  rejects  a chain  only 
if  it  has  a A-literal  followed  by  a complementary  A-literal  or  a 
B-literal  followed  by  an  identical  A-literal  yields  a complete 
inference  system  provided  the  factorization  operation  is  also 
used.  This  corresponds  closely  to  the  chain  permissability 
criteria  for  model  elimination  with  par amodulation  of  [31].  We 
believe  (but  have  no  proof)  that  for  problems  involving  the 
equality  relation,  the  WEAK-SAVE  and  WEAK-DELETE  postprocessing 
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operations  yield  a complete  inference  cystem  when  the  extension, 
reduction,  p-extension  and  p-reduction  operations  are  used  and 
the  GTRONG-SAVE  and  STRONG-DELETE  postprocessing  operations  yield 
a complete  inference  system  when  the  factorization  operation  is 
also  used.  The  equality  reflexive  (x=x)  and  functionally 
reflexive  axioms  (e.g.,  f ( x ,y) =f ( x ,y) ) theoretically  are  required 
in  any  case. 

2.8.8.  Lemma  formation.  An  additional  inference  operation  is 
used  to  create  new  assertions  during  contraction.  A new 
assertion  (called  a lemma)  consisting  of  the  disjunction  of  the 
terminal  A-literal  being  removed  and  all  preceding  A-literals 
whose  scope  (see  below)  exceeds  the  number  of  A-literals  between 
them  and  the  terminal  A-literal  and  the  complements  of  all 
preceding  B-literals  whose  scope  exceeds  the  number  of  A-literals 
between  them  and  the  terminal  A-literal  can  be  inferred. 

The  scope  of  each  literal  in  the  conclusion  is  0 and  the  scope 
of  each  literal  added  to  a chain  in  the  extension  and  p-extension 
operations  is  0.  In  the  factorization  and  reduction  operations 
(and  also  in  ground  factorization  and  reduction  performed  by  the 
postprocessing  operation) , the  scope  of  the  leftmost  involved 
literal  is  set  to  the  maximum  of  its  previous  scope  and  the 
number  of  A-literals  between  it  and  the  rightmost  involved 
literal.  In  the  p-reduction  operation,  the  scope  of  a literal 
descended  from  an  involved  A-literal  is  set  to  the  number  of 
following  A-literals  in  the  derived  chain.  Each  other  literal 
has  the  same  scope  as  its  parent  literal  in  the  parent  chain. 
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After  each  contraction  operation,  the  scope  of  each  literal  is 
set  to  the  minimum  of  its  previous  scope  and  the  number  of 
A-literals  following  it  in  the  chain,  i.e.,  no  literal  will  be 
allowed  to  have  a scope  which  exceeds  the  number  of  following 
A-literals. 

Lemma  formation  creates  assertions  from  solved  goals.  Removal 
of  an  A-literal  by  contraction  does  not  mean  that  the  goal  it 
represents  has  been  solved  globally,  but  only  that  it  has  been 
solved  in  the  environment  of  the  chain  of  which  it  was  a part. 
Joined  in  disjunction  with  the  A-literal  is  the  negation  of  each 
of  the  assumptions  from  the  chain  used  in  the  solution  of  the 
A-literal.  Thus,  the  resulting  lemma  states  either  the  "solved" 
A-literal  is  true  or  one  or  more  of  the  assumptions  was  false. 

The  assumptions  which  could  be  used  in  the  solution  of  the 
A-literal  are  the  negation  of  an  A-literal  (reduction, 
p-reduction)  or  a B-literal  (factorization).  The  scope  mechanism 
keeps  track  of  the  assumptions  made  with  respect  to  the  solution 
of  each  goal. 

Lemma  formation,  while  it  may  generate  useful  assertions 
during  the  search  for  a proof,  is  not  required  for  completeness. 

Several  heuristics  are  available  to  eliminate  the  generation 
of  redundant  lemmas  such  as:  (1)  the  first  lemma  to  be  generated 
after  an  extension  operation  followed  by  zero  or  more 
factorization  or  reduction  operations  is  always  redundant  and  (2) 
if  the  chain  has  two  or  more  terminal  A-literals  and  the  lemma 
associated  with  one  subsumes  the  lemma  associated  with  another. 
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the  second  need  not  be  generated  (specific  cases  of  this 
condition  can  readily  be  checked  by  examining  the  scopes  of  the 
literals  involved) . 

2.8.9.  Subsumption.  Redundant  chains  and  assertions  can  be 
eliminated  from  future  use  by  subsumption.  One  chain  is  subsumed 
by  another  and  can  thus  be  eliminated  if  an  instance  of  the 
latter  chain  is  an  initial  subsequence  of  the  former  (sequences 
of  B-literals  between  A-literals  may  be  freely  reordered  during 
the  subsumption  test).  One  assertion  subsumes  another  if  the 
chain  corresponding  to  the  former  assertion  subsumes  the  chain 
corresponding  to  the  latter  (the  corresponding  chain  is  formed  by 
making  a list  of  B-literals  being  the  consequents  and 
complemented  antecedents  of  implications  or  disjuncts  of 
disjunctions)  and,  in  the  case  where  the  subsuming  assertion  is 
an  implication,  no  disjunct  or  consequent  of  the  subsumed 
assertion  is  matched  to  an  antecedent  of  the  subsuming  assertion. 
The  latter  provision  prevents  the  subsumption  of  an  assertion  by 
another  implicative  form  of  the  same  assertion  both  of  which  may 
be  required  for  completeness.  Stronger  subsumption  rules  are 
possible  (see  [31]). 

With  relation  to  a search  strategy,  two  additional  classes  of 
subsumption  are  recognized:  (1) 

subsumption  of  a newly  created  chain  or  assertion  by  a previously 
available  chain  or  assertion  and  (2)  backward_subsumgtion  is  the 
subsumption  of  a previously  available  chain  or  assertion  by  a 
newly  generated  chain  or  assertion.  In  general,  completeness  is 
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'I 

assured  only  if,  when  backward  subsumption  is  used,  it  is  first  ' 

checked  whether  the  subsuming  chain  is  eliminable  by  forward 
subsumption. 

This  and  the  following  proofs  illustrate  the 
usage  of  most  forms  of  the  inference  operations.  Chains  are  ■ 

represented  as  linear  strings  of  literals  with  A-literals 
bracketed.  A-literals  represent  "opened"  goals,  i.e.,  goals  for 
vrtiich  a solution  is  currently  being  attempted  in  the  chain.  i 

B-literals  represent  "unopened"  goals,  i.e.,  subgoal  for  which 
an  attempt  for  solution  has  not  yet  started.  Each  A-literal  is  a 
logical  consequence  of  all  the  literals  to  its  right;  thus,  the 
I solution  of  each  B-literal  to  the  right  of  an  A-literal  solves 

the  A-literal  while  also  solving  all  the  other  A-literals  to  the 
right  of  the  solved  A-literal.  Deductions  are  represented  as  a 
vertical  sequences  of  chains,  the  ancestor  of  each  derived  chain 
being  the  chain  above  it.  Each  derived  chain  is  annotated  to 
describe  its  derivation  from  its  ancestor.  If  a chain  is  the 
result  of  extension  or  p-extension  by  an  assertion  with  more  than 
one  consequent  or  disjunct,  an  alphabetic  index  is  used  to 
designate  which  consequent  or  disjunct  was  used.  Indices  are 
"a",  "b",  "c",  etc.,  reading  from  right  to  left.  The  unannotated 
chains  at  the  top  of  each  sequence  of  chains  are  the  axioms. 

Here  and  elsewhere  in  this  paper,  the  conclusion  and  assertions 
of  a problem  will  frequently  be  referred  to  as  theorem  and  axioms 
respectively.  I 

This  is  a proof  that  -» (Pa  Pb)  ♦ a^b.  1 
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1 


Pa  V Pb 


first  axiom  from  -»{Pa*^Pb) 
second  axiom  from  -•  (Pa  Pb) 
theorem  to  be  established 


2.  ■'Pa  V -Pb 

3.  a^b 

4.  la/b]  -*Pb  -tPh  p- ex  tend  to  lb 


This  operation  initiates  a proof  by  contradiction. 
Assuming  a=b  (the  complement  of  the  literal  a^b) , the 
truth  of  -Pb  A -.pb  contradicts  Pa  v Pb. 


5.  [aj^b]  -•Pb  factor 


It  is  only  necessary  to  prove  -Pb  once. 


6.  [aT^bl  [-Pb]  Pa  extend  by  2a 


By  axiom  2,  if  Pa  is  true  then  -Pb  is  true. 


7.  [a^b]  I-Pb]  [Pa]  Pb  p-reduce  from  A-literal 


Again  assuming  a=b  to  derive  a proof  by  contradiction,  if 
Pb  is  true  then  Pa  is  true. 


8.  0 reduce 


In  chain  7,  subject  to  the  assumption  that  the  conclusion 
a/b  is  false  and  that  a»b,  there  are  the  implications  Pb 
Pa  and  Pa  -Pb.  This  leads  to  Pb  ♦ -Pb,  a 

contradiction  if  Pb  is  true.  Therefore,  -Pb  must  be 
assumed  to  be  true.  From  chain  5,  -Pb  ♦ a?^b,  so  the 

theorem  is  proved. 


The  following  four  examples  are  adapted  with  some  modification  from  [32] 


Example  2^15. 

1.  B - C 

2.  A A -,D  ♦ B 

3.  A ♦ C V -.D 

4.  A 

5.  C theorem 
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6. 

(Cl 

B 

extend 

by 

1 

7. 

(Cl 

(B1 

A -D 

extend 

by 

2 

8. 

[C] 

(B1 

A (-01  A -C 

extend 

by 

3a 

9. 

(Cl 

(B1 

A (-D1  A 

reduce 

10. 

(Cl 

(B1 

A 

factor 

11. 

0 

extend 

by 

4 

1.  b<a  A a£b  ♦ a=b 

2.  b>0 

3.  a<_b 

4 . b£a 

5 . a>0 

6.  (a>01  b£a  a£b  b^O 

7.  (a£oi  bja  a£b 

8.  {a>oj  b^a 

9.  ^ 


theorem 

p-extend  from  1 
extend  by  2 
extend  by  3 
extend  by  4 


Example  2^17. 


1.  a>0  ♦ a>^0 

2.  -.b>0 

3.  a>0 

4.  a^b 

5.  (a^b)  a>0  -b>0 

6.  [a/b]  a>0 

7.  0 

1.  a+b“2*c  ♦ a^b  v a=c 

2.  b^c 

3.  a+b«2*c 

4 . a^b 

5.  (a/bl  a+b»2*c  a?^c 

6.  (aybl  a+b«2*c  [a^c]  b^c 

7.  Ia?^b)  a+b«2*c 

8.  0 


theorem 
p-extend  to  1 
extend  by  2 
extend  by  3 


theorem 
extend  by  lb 
p-reduce  from  A-literal 
extend  by  2 
extend  by  3 


2.9.  .The  !Pro9ra0un^d:>Ie  .Strategy  .Theorem  Prover 

The  Programmable  Strategy  .Theorem  Prover  (PSTP)  is  a program 
written  in  .UC.I  LISP  [8]  <for  the  DECsystem-10  computer 
implementing  the  linear  MESON  procedure. 
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The  linear  MESON  procedure  is  a good  inference  system  for  an 
interactive,  programmable  strategy  theorem  prover  since,  it  being 


an  extension  of  the  problem-reduction  method,  it  is  more 
human-oriented  than  alternative  systems,  and  its  input  procedure 
nature  and  the  relatively  small  number  of  operations  that  can  be 
performed  on  any  chain  facilitates  the  design  and  use  of  the 
programmable  search  strategy  capability. 

In  addition,  the  linear  MESON  procedure  is  a suitable  choice 
in  terms  of  performance  since  it  appears  to  perform  competitively 
with  other  inference  procedures  when  used  as  the  inference  system 
in  a fully  automatic  system.  An  implementation  of  the  parent 
model  elimination  procedure  at  New  York  University  [12]  using  a 
depth  first  search  strategy  performed  competitively  with  a 
theorem  prover  employing  the  set  of  support  refinement  and  unit 
preference  search  strategy.  Further  evidence  of  the 
competitiveness  of  linear  MESON  procedure  based  systems  will  be 
presented  in  the  next  section. 

2.9.1.  Chain  properties.  In  the  design  of  a theorem  proving 
program,  it  is  necessary  to  allow  for  the  computation  and 
retention  of  certain  information  about  each  chain  (clause) 
generated  during  the  search  for  a proof.  An  example  is  the 
necessity  of  retaining  information  on  parentage  of  each  chain  so 
a proof  can  be  traced  when  discovered.  Another  example  is  the 
computation  of  the  length  of  a chain  or  the  maximum  level  of 
function  nesting  in  the  chain  if  length  or  depth  bounds  are  being 
employed.  It  would  be  wasteful  to  always  compute  and  store  such 
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information  since  it  may  not  always  be  needed.  Also,  the  : 

I 

retention  of  computed  information  about  chains  should  be  | 

contingent  on  such  variables  as  the  computational  effort  required 

i 

to  compute  the  information,  frequency  of  use  of  the  information, 
and  cost  in  memory  of  storing  the  information.  (A  more 
fundamental  objection  to  always  storing  computed  information 
about  a chain  is  that  the  information  might  change  with  time. 

i 

For  example,  the  fact  that  a particular  chain  is  the  shortest 
chain  in  memory  will  probably  be  falsified  in  the  future.) 

Another  important  consideration  in  the  design  of  an  information 
storage  and  retrieval  mechanism  for  chains  is  the  ability  to 
define  new  data  which  can  be  optionally  computed  for  any  chain. 

' This  last  consideration  is  especially  important  in  an 

interactive  theorem  proving  program  so  that  the  user  can  cause  to 
be  computed  whatever  information  about  a chain  will  be  useful  to 
him.  It  is  also  an  important  consideration  in  the  design  of  a 
theorem  proving  program  which  allows  user  specification  of  search 
strategies. 

The  property  storage  and  retrieval  mechanism  for  chains  in 
PSTP  was  designed  to  possess  the  following  characteristics.  With 
the  obvious  exceptions  of  the  number  of  a chain  and  its  ancestry 
information,  no  information  about  a chain  is  computed  unless  and 
until  this  information  is  requested.  Retaining  the  information 
is  a user  option.  A new  computable  datum  about  a chain  can  be 
defined  be  merely  defining  the  LISP  function  which  computes  the  i 

information.  ] 


I 

I 
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The  mechanism  used  is  based  on  the  concept  oi  a chain  property 
list.  This  is  a list  of  dotted  pairs;  the  first  component  of 
each  dotted  pair  is  a property  name  (the  access  name  of  a datum 
about  the  chain) ; the  second  component  is  the  value  of  the 
property.  The  information  storage  and  retrieval  mechanism 
functions  in  the  following  way.  If  the  value  of  the  property 
named,  for  example,  NLIT  (this  represents  in  PSTP  the  number  of 
literals  in  the  chain)  is  interrogated  for  a chain,  that  chain's 
property  list  is  examined  for  a dotted  pair  with  first  component 
NLIT.  If  such  a dotted  pair  is  found,  its  second  component  is 
the  desired  information.  If  property  NLIT  does  not  appear  in  the 
chain  property  list,  the  LISP  function  NLIT  is  evaluated  with  the 
chain  (including  chain  property  list)  as  its  single  argument. 

The  value  the  function  NLIT  returns  is  then  the  desired 
information.  Further,  if  the  LISP  atom  NLIT  has  non-NiL  value, 
the  property  name  NLIT  and  newly  computed  value  will  be  added  to 
the  chain  property  list. 

Nev;  properties  are  defined  by  the  DP  ("define  property") 
function.  The  DP  function  takes  as  arguments  a function  name, 
lambda  variable  list,  and  expression  (just  like  the  UCI  LISP  DE, 
DF,  and  DM  functions).  It  creates  a LISP  function  which  performs 
edl  the  chain  property  list  lookup  and  modification  operations, 
and  evaluates  expression  for  the  argument  if  the  property  value 
is  not  found  on  the  chain  property  list.  For  example,  NLIT  is 
defined  in  PSTP  by  evaluating  (DP  NLIT  (CHAIN)  (LENGTH  (CDDR 
CHAIN)))  where  LENGTH  is  the  LISP  function  which  computes  the 
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length  of  a list  and  (CDDR  CHAIN)  is  the  location  of  the  list  of 
literals  of  chain  CHAIN.  (SETQ  NLIT  T)  is  then  evaluated  to 
order  retention  of  values  computed  by  the  NLIT  function. 

Some  of  the  property  functions  already  defined  in  PSTP  compute 
the  number  of  A-literals  in  a chain  (NALIT) , the  number  of 
3-literals  in  a chain  (NBLIT) , the  total  number  of  literals  in  a 
chain  (NLIT),  the  maximum  function  depth  in  a chain  (DEPTH),  the 
number  of  variables  in  a chain  (NVAR) , the  number  of  LISP  CONS 
operations  required  to  construct  a chain  (SIZE)  (this  is  a good 
size  or  complexity  function)  , and  the  level  (number  of  inference 
operations  in  the  derivation)  of  a chain  (NEXPAND) . 

2.9.2.  Chain  filters.  This  property  storage  and  retrieval 
mechanism  supports  a higher  level  chain  storage  and  retrieval 
mechanism.  Filters  provide  a way  of  flexibly  specifying  which 
chains  are  to  be  operated  upon  and  which  derived  chains  are  to  be 
stored.  Two  types  of  filters  are  distinguished  by  usage;  input 
filters  and  output  filters.  Input  filters  are  employed  by  the 
user  to  specify  which  chains  are  to  be  operated  upon.  Only 
chains  "selected  by"  an  input  chain  filter  will  be  processed. 
Output  filters  are  used  to  specify  which  derived  chains  are  to  be 
retained.  A chain  must  be  "accepted  by"  an  output  filter  to  be 
stored.  The  general  form  for  a filter  is  a unary  LISP  function 
name  or  lambda  expression  which  returns  a non-NIL  value  if  the 
chain  argument  is  selected  or  accepted,  NIL  otherwise.  Several 
abbreviated  forms  are  also  available:  (1)  an  integer  selects  or 
accepts  a chain  with  that  chain  number,  (2)  a list  of  integers 
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selects  or  accepts  chains  with  chain  numbers  in  the  list,  (3)  a 
three  element  list  (called  a triple)  consisting  of  a binary 
function  name  and  2 integers  or  property  names  selects  or  accepts 
chains  for  which  the  value  of  the  function  applied  to  the 
integers  and  property  values  is  non-NIL,  and  (4)  a list  of 
triples  which  selects  or  accepts  chains  for  which  each  triple  has 
non-NIL  value. 

This  chain  storage  and  retrieval  mechanism  is  very  flexible. 
The  user  can  designate  chains  for  processing  directly  by  number 
or  by  the  properties  they  possess  and  can  arbitrarily  specify  the 
necessary  conditions  for  a newly  derived  chain  to  be  stored. 

This  user  specification  of  output  filters  is  a far  more  general 
form  of  the  usual  specification  of  bounds  in  theorem  proving 
pr  ograms. 

2.9.3.  Search  strategy  specification.  One  of  the  most 
important  features  of  PSTP  is  its  capability  for  specifying  the 
search  strategy  to  be  used  in  searching  for  a proof.  Several 
theorem  proving  programs  (e.g.,  QA3.6  [36])  permit  the  user  to 
specify  a particular  combination  of  refinements  of  resolution 
(restrictions  on  pairs  of  clauses  to  be  used  as  input  to  the 
resolution  operation  (e.g.,  linear,  merging,  set  of  support, 
model  refinements)),  but  the  capability  for  ordering  inference 
operations  given  a particular  refinement  of  resolution  is 
uncommon.  PSTP  is,  of  course,  restricted  to  using  the  linear 
MESON  procedure  with  variations  restricted  to  different 
postprocessing  operations,  but  it  does  have  a general  capability 
for  specifying  search  strategies. 
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Before  describing  the  search  strategy  specification  : 

capability  of  PSTP,  it  is  instructive  to  consider  the  proof 
strategy  employed  in  many  other  theorem  proving  systems  in  which 
search  strategy  is  fixed  with  possibly  a few  parameters  which  the 
user  can  specify  to  tailor  the  proof  search  to  a particular 
problem.  Thus,  the  search  strategy  may  be  fundamentally  depth 
first  or  perhaps  breadth  first  with  a parameter  specifying  the 
permitted  amount  of  look-ahead  using  unit  preference.  Much  of 
the  control  the  user  has  over  such  systems  is  the  specification 

of  which  chains  to  discard.  However,  even  this  decision  is 

i 

severely  constrained.  Usually,  the  user  is  only  permitted  to 
specify  the  values  of  a few  parameters  such  as  the  maximum  length 
, or  function  depth  of  clauses  to  be  retained. 

We  have  seen  that  output  filters  generalize  the  capability  of 
specifying  retention  of  chains.  Chain  order  functions  provide 
the  capability  of  specifying  the  order  of  expansion  of  the  search 
space. 

2.9.4.  Chain  order  functions.  Associated  with  each  list  of 
chains  is  the  name  of  an  order  function.  (The  order  function  is 
actually  a chain  property  function  as  described  above.)  Whenever 
a chain  is  stored  in  a chain  list,  it  is  inserted  according  to 
the  numerical  value  of  the  corresponding  order  function  applied 
to  the  chain.  The  chains  with  the  smallest  values  of  the  order 
function  are  stored  at  the  top  of  the  chain  list  (in  case  of 
ties,  the  more  recently  stored  chains  will  be  on  top  of  the  chain 
list)  . This  maintenance  of  chain  lists  in  sorted  order  in  j| 
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combination  with  the  SEARCH  and  SEARCH2  commands  provides  a quite 
general  capability  for  specifying  search  strategies. 

2.9.5.  Search  commands.  The  SEARCH  command  is  one  of  the 
fundamental  functions  for  automatically  expanding  the  search 
space.  The  normal  mode  of  operation  is  for  the  SEARCH  command  to 
remove  the  top  chain  from  a chain  list,  derive  all  possible 
immediate  successor  chains  from  this  chain  (this  is  known  as 
expanding  the  chain) , and  store  those  successor  chains  selected 
by  an  output  filter  in  the  original  chain  list  according  to  its 
order  function.  Thus  by  specifying  an  order  function  and  using 
the  SEARCH  command,  the  user  can  specify  in  what  order  chains  are 
to  be  expanded  and  thus  partially  control  the  search  strategy. 

For  example,  if  the  default  order  function  (which  merely  returns 
0)  is  used,  the  search  strategy  is  a depth  first  strategy.  If 
the  deduction  level  of  a chain  is  used  as  order  function,  the 
search  strategy  is  a pure  breadth  first  strategy  (level 
saturation) . The  SEARCH  command  can  be  viewed  as  an 
implementation  of  Nilsson's  A*  algorithm  135]  for  graph  searching 
as  applied  to  theorem  proving.  Each  derived  chain  is  a node  in 
the  graph  and  the  generation  of  all  immediate  successors  to  a 
chain  by  extension,  factorization,  etc.,  represents  the  expansion 
of  a node  in  the  A*  algorithm. 

Although  the  SEARCH  command  is  very  effective  in  ordering  the 
expansion  of  chains,  the  full  expansion  of  a chain  at  each  step 
often  results  in  generating  a large  number  of  chains  that  will 
not  be  used  because  the  value  of  the  order  function  for  these 
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chains  exceeds  the  maximum  order  function  value  of  any  chain 
appearing  in  some  proof.  This  presents  two  difficulties:  (1) 
unused  high  order  function  valued  chains  fill  up  memory  too 
quickly  and  (2)  their  generation  requires  extra,  unnecessary 
work.  The  first  of  these  problems  could  be  solved  by  specifying 
an  output  filter  that  rejects  chains  with  order  function  value 
exceeding  a certain  amount.  However,  this  solution  generates  a 
bounded  search  strategy,  i.e.,  a parameterized  incomplete 
strategy  which  may  fail  to  find  a proof  because  the  order 
function  maximum  is  set  too  low.  Moreover,  the  specification  of 
a bounded  search  strategy  fails  to  solve  the  problem  of  extra 
work  required  in  the  generation  of  rejected  chains. 

The  solution  adopted  for  PSTP  includes  a means  for 
specification  of  ordering  of  individual  inference  operations 
rather  than  just  chains.  The  form  of  the  value  of  the  order 
function  was  generalized  to  include  sorted  lists  of  operations 
with  numerical  values.  For  example,  the  order  function  value 
((201  REDUCE)  (302  EXTEND  6))  could  represent  the  order  function 
value  for  a chain  with  previously  unperformed  operations  of 
reduction  and  extension  by  chain  number  6.  Chains  with  order 
function  values  of  this  form  are  inserted  into  chain  lists 
according  to  the  numerical  value  of  the  first  specified  operation 
(201  in  the  example)  . 

The  SEARCH2  command  is  designed  to  operate  on  chain  lists  with 
order  functions  of  the  new  form.  The  SEARCH2  command,  rather 
than  deleting  and  expanding  the  top  chain  on  the  chain  list, 
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deletes  and  performs  only  the  first  inference  operation  of  the 
order  function  value  of  the  top  chain  on  the  chain  list.  If  any 
inference  operations  remain  in  the  order  function,  the  top  chain 
is  reinserted  in  the  chain  list  according  to  the  value  of  the 
next  specified  inference  operation.  Thus,  the  chain  list  is 
always  a list  of  chains  ordered  according  to  the  minimum  value  of 
the  unperformed  inference  operations  for  that  chain.  The  SEARCH2 
command  will  generate  successors  of  a single  chain 
uninterruptedly  only  so  long  as  none  of  the  successors  of  the 

1 

chain  or  any  other  chain  on  the  chain  list  has  an  inference 
! operation  with  lower  numerical  value  than  the  next  inference 

operation  to  be  performed  on  the  current  chain. 

The  SEARCH2  command  can  also  be  viewed  as  an  implementation  of 
Nilsson's  A*  algorithm  for  graph  searching  with  each  node  being  a 
chain  and  a single  inference  operation.  Expansion  of  a node  now 
merely  consists  of  applying  that  inference  operation  to  that 
chain. 

Using  the  SEARCH2  command,  for  example,  it  is  possible  to  use 
an  order  function  which  specifies  a depth  first  search  strategy 
that  performs  only  one  inference  operation  on  each  level  (until  a 
level  bound  is  reached  forcing  a backup)  . A more  realistic  order 
function  for  use  with  the  SEARCH2  command  was  used  in  the 
experiment  described  in  2.10.  Note  that  the  performance  of  the 
single  specified  inference  operation  may  actually  result  in  the 
generation  of  more  than  one  successor  chain  as,  for  example,  when 
there  are  two  ways  of  extending  a chain  by  a particular  axiom. 
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2.9.6.  Format  functions.  An  additional  mechanism  for  altering 
the  strategy  used  by  PSTP  is  accomplished  by  the  use  of  format 
functions.  Format  functions  can  be  used  to  reformat  (edit) 
chains  prior  to  their  storage.  For  example,  a format  function 
can  be  used  to  reorder  the  last  B-literals  of  a chain  to 
accomplish  the  effect  of  Kowalski  and  Kuehner's  literal  selection 
function  mechanism  [25].  A format  function  is  associated  with 
each  chain  list.  Each  chain  is  reformatted  according  to  the 
order  function  of  the  chain  list  into  which  it  is  being  stored 
unless  it  is  already  in  that  format.  A chain  may  be  stored  in 
two  different  formats  in  two  different  chain  lists  (formats  are 
permitted  to  alter  the  sequence  of  literals  which  constitutes  the 
chain  itself,  but  not  the  chain  property  list).  Default  format 
functions  are  those  which  convert  chains  to  x-standardized  or 
y-standardized  form  by  renaming  variables. 

2.9.7.  Command  summary.  Following  are  brief  descriptions  of 
most  of  the  PSTP  commands.  These  PSTP  commands  can  be  divided 
into  four  classes:  declarative  commands  (CHAINLIST,  PARAMETERS, 
POSTPROCESSING,  PROBLEM)  , informative  commands  (ANCESTRY,  COUI'iT, 
DISPLAY),  manipulative  commands  (COPY,  DELETE,  FOR,  TRANSFER), 
and  inference  commands  (EXPAND,  SEARCH,  SEARCH2) . An  abbreviated 
syntax  for  each  command  is  also  presented;  linguistic  variables 
are  enclosed  in  angle  brackets  (e.g.,  "<sources>")  and  optional 
command  phrases  are  enclosed  in  square  brackets  (e.g.,  "(DELETE)" 
and  "(TO  <destinations> ] ”) . If  any  phrase  of  a command  is 
absent,  a default  value  will  be  used. 
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In  the  descriptions  of  the  commands,  the  most  important 
linguistic  variables  are  <sources>  and  <destinations> . A source 
or  destination  represents,  in  general,  a chain  list  and  a chain 


filter.  For  a chain  to  be  used  by  a command  specifying 
<sources>,  it  must  be  a member  of  one  of  the  specified  chain 
lists  and  be  selected  by  the  corresponding  chain  filter.  For  a 
chain  to  be  stored  in  the  chain  list  of  a destination  by  a 
command  specifying  <destinations> , it  must  be  accepted  by  the 
corresponding  chain  filter.  Note  that  in  the  syntax,  <sources> 
and  <destinations>  are  never  optional  (except  as  part  of  an 
optional  phrase) . This  is  because  the  empty  specification  for 
<sources>  and  <destinations>  is  legal  and  has  a default  value. 

Any  of  the  non^declar ative  commands  can  be  interrupted  at  any 
time  by  typing  any  character.  The  command  completes  processing 
of  the  current  chain  and  then  enters  a "break".  In  this  state, 
the  user  can  execute  any  PSTP  command  or  LISP  function  and 
continue  or  abort  the  processing  of  the  interrupted  command. 

2.9.8.  ANCESTRY  command.  The  (ANCESTRY  [DELETE]  <sources>) 
command  prints  the  derivation  of  each  chain  designated  by 
<sources>.  If  DELETE  is  specified,  each  designated  chain  is  also 
deleted  from  its  chain  list. 

2.9.9.  CHA INLIST  command.  The  (CHAINLIST  I<declar ations> ] ) 
command  is  used  to  declare  chain  lists  that  will  be  used  and 
their  format  and  order  functions.  If  <declar ations>  is  absent, 
the  CHAINLIST  command  prints  the  list  of  previously  declared 
chain  lists  and  their  format  and  order  functions. 

94 


2.9.10.  COPY  command.  The  (COPY  (DELETE]  <sources>  [TO 


<destinations> 1 ) command  copies  each  chain  designated  by 
<sources>  to  each  of  <destinations> . l£  DELETE  is  specified, 
each  designated  chain  is  also  deleted  from  its  chain  list. 

2.9.11.  COUNT  command.  The  (COUNT  [DELETE]  <sources>)  command 
counts  the  number  of  chains  designated  by  <sources>.  If  DELETE 
is  specified,  each  designated  chain  is  also  deleted  from  its 
chain  list. 

2.9.12.  DELETE  command.  The  (DELETE  <sources>)  is  the  same  as 
the  (COUNT  DELETE  <sources>)  command,  i.e.,  the  COUNT  command 
with  chain  deletion  specified. 

2.9.13.  DISPLAY  command.  The  (DISPLAY  [DELETE]  <sources>) 
command  prints  each  chain  designated  by  <sources>.  If  DELETE  is 
specified,  each  designated  chain  is  also  deleted  from  its  chain 
list . 

2.9.14.  EXPAND  command.  The  (EXPAND  [EXTEND]  [FACTOR] 

(REDUCE]  [PEXTEND]  [PREDUCE]  (DELETE]  <sources2^>  [BY  <sourceS2>] 
(GIVING  <destinationS]^>]  (AND  <destinations2> ] ) is  the  principal 
inference  command  for  interactive  use.  It  will  perform  the 
designated  inference  operations  on  each  of  the  chains  designated 
by  <sources2>  using  each  chain  designated  by  <sources2>  as  second 
argument  to  binary  inference  operations  (extension  and 
p-extension)  . Derived  chains  will  be  stored  in  <destinations2^> 
and  lemmas  will  be  stored  in  <destinations2>  (chain  filters 


permitting).  If  DELETE  is  specified,  each  designated  chain  in 
<sourcesj^>  is  also  deleted  from  its  chain  list.  The  EXPAND 
command  is  restricted  to  performing  inference  operations  on 
chains  existing  at  the  time  of  its  invocation,  i.e.,  it  will  not 
perform  any  inference  operations  on  chains  it  has  just  derived. 
The  command  terminates  when  (1)  the  empty  chain  is  generated, 
i.e.,  a proof  has  been  found,  (2)  all  the  specified  operations 
have  been  performed,  or  (3)  the  user  suspends  processing  of  the 
command  by  typing  any  character.  If  no  inference  operations  are 
designated,  all  inference  operations  will  be  used.  If  at  least 
one  inference  operation  is  designated,  the  word  EXPAND  may  be 
omitted . 

2.9.15.  FOR  command.  The  (FOR  [DELETE]  <sources>  DO 
<function>)  command  applies  the  unary  LISP  function  <function>  to 
each  chain  designated  by  <sources>.  If  DELETE  is  specified,  each 
designated  chain  is  also  deleted  from  its  chain  list. 

2.9.16.  PARAMETERS  command.  The  (PARAMETERS  [<index>]) 
command  is  used  to  declare  the  values  of  several  global 
parameters.  If  an  <index>  (an  arbitrary  LISP  atom)  is  specified, 
it  designates  for  use  the  predefined  set  of  parameter  values 
associated  with  <index>.  If  an  <index>  is  not  specified  or 
<index>  has  no  previously  defined  meaning,  the  PARAMETERS  command 
asks  a series  of  questions  requiring  the  user  to  define  the  value 
for  each  parameter.  Parameters  set  by  the  PARAMETERS  command 
include:  whether  newly  generated  chains  are  to  be  printed,  the 
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format  in  which  chains  are  to  be  printed,  whether  lemmas  are  to 
be  generated,  and  whether  subsumption  is  to  be  performed. 


2.9.17.  POSTPROCESSING.  The  (POSTPROCESSING  I<index>l) 
command  is  used  to  declare  what  postprocessing  operation  is  to  be 
employed.  If  an  <index>  (an  arbitrary  LISP  atom)  is  specified, 
this  <index>  designates  the  postprocessing  operation  that  will  be 
used.  Allowed  <index>s  include  WEAK-SAVE,  WEAK-DELETE, 
STRONG-SAVE,  and  STRONG-DELETE,  designating  the  postprocessing 
operations  described  previously.  If  an  <index>  is  not  specified 
or  <index>  has  no  previously  defined  meaning,  the  POSTPROCESSING 
command  asks  a series  of  questions  requiring  the  user  to 
designate  which  action  among  a list  of  alternative  actions  is  to 
be  taken  for  a given  condition.  For  example,  the  POSTPROCESSING 
command  may  ask  whether,  in  the  case  of  an  A-literal  followed  by 
an  identical  B-literal,  the  B-literal  should  be  saved,  deleted, 
or  deleted  with  the  reduction  operation  recorded  in  the  ancestry 
of  the  chain. 

2.9.18.  PROBLEM  command.  The  (PROBLEM  [<declar ations> ] ) 
command  sets  up  a problem  for  the  theorem  prover.  It  first  makes 
the  chainlist  declarations  specified  by  <declar ations>  (if  fewer 
than  two  chain  list  declarations  are  specified,  up  to  two  default 
declarations  will  be  made)  and  then  asks  the  user  to  type  in  the 
theorem  and  each  axiom.  The  theorem  is  stored  in  the  first 
declared  chain  list;  its  negation  and  the  axioms  are  stored  in 
the  second  declared  chain  list.  The  input  format  for  the  theorem 
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and  axioms  is  the  same  as  was  used  in  the  description  of  the 
linear  MESON  procedure  except  that  prefix  form  for  predicate  and 
function  syr..bols  is  required,  and  (due  to  character  set 


limitations)  a and  y are  omitted  and  — > is  substituted  for 
Thus,  Pab  A Pba  ♦ a=b  v Qabx  is  typed  in  as  Pab  Pba  — > =ab  Qabx. 
The  theorem  and  axioms  are  then  encoded  into  internal  list  form. 
The  PROBLEM  command  permits  the  user  to  save  the  encoded  axioms 
so  that  they  will  not  need  to  be  retyped  in  future  proofs  of  the 
same  problem. 

2.9.19.  SEARCH  command.  The  (SEARCH  (EXTEND]  [FACTOR] 

[REDUCE]  [PEXTEND]  [PREDUCE]  <sourceSi>  [BY  <sourceS2>]  [GIVING 
<destinationsj^>]  [AND  <destinations2>]  ) repeatedly  deletes  the 
first  chain  from  <sources2>  (a  chain  with  lowest  order  function 
value)  and  performs  on  it  each  designated  inference  operation 
with  the  chains  designated  by  <sources2>  as  second  argument  to 
binary  inference  operations  (extension  and  p-extension) . Derived 
chains  will  be  stored  in  <destinations2^>  and  lemmas  will  be 
stored  in  <destinationS2>  (chain  filters  permitting) . So  that 
newly  generated  chains  can  be  used  as  input  to  inference 
operations  by  the  SEARCH  command,  <sourceS]^>  and  <destinations2> 
will  ordinarily  specify  the  same  chain  lists.  The  command 
terminates  when  (1)  the  empty  chain  is  generated,  i.e.,  a proof 
has  been  found,  (2)  <sourcesj^>  is  empty  meaning  no  more 
operations  can  be  performed  and  no  proof  could  be  found  within 
the  constraints  of  the  specified  operations,  initial  chains,  and 
chain  filters,  or  (3)  the  user  suspends  processing  of  the  command 


98 


by  typing  any  character.  If  no  inference  operations  are 
designated,  all  inference  operations  will  be  used. 

2.9.20.  SEARCH2  command.  The  (:»EARCH2  [EXTEND]  (FACTOR] 

(REDUCE]  [PEXTEND]  (PREDUCE]  <sources2>  <sources2>]  (wlVING 

<destinationsj^>]  (AND  <destinations2>]  ) repeatedly  deletes  the 

first  chain  from  <sourcesj>  (a  chain  with  lowest  order  function 

value)  and  performs  on  it  the  first  designated  inference 

operation  in  the  order  function  value.  This  inference  operation 

is  then  deleted  from  the  order  function  value  and,  if  any 

inference  operations  remain  in  the  order  function  value,  the 

* < 
chain  is  reinserted  in  <sources2>  (now  with  the  numerical  value  j 

associated  with  the  next  inference  operation  as  the  numerical  | 

value  of  the  chain  for  insertion  into  the  sorted  chain  list)  . 

I 

Derived  chains  will  be  stored  in  <destinationsi > and  lemmas  will  I 

be  stored  in  <destinations2>  (chain  filters  permitting) . So  that 

newly  generated  chains  can  be  used  as  input  to  inference 

operations  by  the  SEARCH2  command,  <sources2>  and  <destinationsi^> 

will  ordinarily  specify  the  same  chain  lists.  The  order 

function,  using  variables  of  the  SEARCH2  function,  will  construct 

a list  of  inference  operations  with  (in  the  case  of  binary 

inference  operations)  second  arguments  as  specified  in  <sources2> 

for  derived  chains  as  they  are  stored.  The  command  terminates 

when  (1)  the  empty  chain  is  generated,  i.e.,  a proof  has  been 

found,  (2)  <sourcesj^>  is  empty  meaning  no  more  operations  can  be 

performed  and  no  proof  could  be  found  within  the  constraints  of 

the  specified  operations,  initial  chains,  and  chain  filters,  or 
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(3)  the  user  suspends  processing  of  the  command  by  typing  emy 
character.  If  no  inference  operations  are  designated^  all 
inference  operations  will  be  used. 

2.9.21.  TRANSFER  command.  The  (TRANSFER  <sources>  (TO 
<destinations>] ) command  is  the  same  as  the  (COPY  DELETE 
<sources>  (TO  <destinations>] ) command,  i.e.,  the  COPY  command 
with  chain  deletion  specified. 


2.10.  PSTP  performance  study 

In  order  to  give  some  idea  of  the  performance  of  PSTP  with 
some  simple  search  strategies  and  to  make  some  points  about 
relative  merits  of  some  of  these  strategies,  the  results  of  PSTP 
runs  on  9 examples  using  4 strategies  are  presented  here. 

Results  are  compared  to  results  for  two  other  theorem  proving 
programs  tested  on  the  same  examples. 

2.10.1.  The  problems.  The  problems  are  taken  from  a 
comparative  study  of  theorem  proving  strategies  used  by  QA3.6  by 
Reboh  et  al  (36]  (additional  information  on  sources,  theory,  and 
previous  uses  of  these  excimples  are  in  (36]};  the  same  examples 
were  also  run  for  an  SL-resolution  theorem  prover  (here  called 
SLRTP)  by  Aubin  [1,2].  The  examples  are  axiomatized  just  as  for 
QA3.6  with  an  occasional  substitution  of  a disjunction  for  an 
implication  and,  in  the  cases  of  unsati sf iable  sets  of  axioms, 
the  use  of  the  negation  of  one  of  the  axioms  as  the  theorem. 
Appendix  A contains  a list  of  the  problems,  results,  and  proofs. 


2.10.2.  Inference  operations  used.  All  the  problems  were  run 
with  extension  as  the  only  rule  of  inference  except  the  NUMl 
example  for  which  reduction  was  also  necessary.  The  WEAK^OCLETE 
postprocessing  operation  was  used  for  all  the  examples.  Its  use, 
of  course,  permits  ground  factorization  and  reduction.  In  some 
examples  (BURSTALL,  SHORTBURST,  GROUPl,  GR0UP2) , it  is  readily 
apparent  from  the  structure  of  the  problem  that  no  reduction  is 
possible  (since  every  chain  derived  from  the  theorem  has  only 
positive  literals  eliminating  any  possibility  of  matching  an 
A-literal  with  a complementary  B-literal) . The  ANCESl  example  is 
propositional  and  thus  the  ground  reduction  in  the  ViEAK-DELETE 
postprocessing  operation  is  sufficient.  In  the  remaining  three 
problems  (HAS-PARTSl,  HAS-PARTS2,  PRIM)  for  which  reduction  was 
not  employed  (although  ground  reduction  was  used  in  each)  , the 
use  of  the  reduction  operation  resulted  in  the  generation  of  no 
additional  chains.  Lemmas  were  not  generated  for  any  of  the 
examples . 

2.10.3.  Search  strategies  used.  The  strategies  used  are 
characterized  by  4 parameters:  length  multiplier,  level 
multiplier,  length  maximum,  and  level  maximum.  The  length  of  a 
chain  is  defined  to  be  its  number  of  B-literals.  This  is 
consistent  with  the  notion  of  the  length  of  a clause  in 
resolution  theorem  proving  being  its  number  of  literals  since  in 
a chain  A-literals  record  ancestry  information  and  would  not  be 
present  in  the  corresponding  clause  form.  The  level  of  a chain 
is  defined  to  be  the  number  of  inference  operations  employed  in 
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deriving  it  from  the  alleged  theorem  excluding  those  operations 
(ground  factorization  and  reduction)  autc.-natically  performed  by 
the  postprocessing  operation. 

The  SEARCH2  search  command  was  employed  with  projected 
inference  operations  ordered  according  to  the  minimum  values  of  a 
weighted  sum  of  the  expected  length  and  level  of  the  result.  The 
expected  length  of  a chain  derived  by  extension  is  the  length  of 
its  parent  chain  being  extended  plus  the  length  of  the  axiom 
minus  2.  The  expected  length  of  a chain  derived  by  factorization 
or  reduction  is  the  length  of  its  parent  chain  minus  1.  The 
actual  length  may  be  less  (but  never  more)  due  to  removal  of 
B-literals  by  the  accepting  transformation.  The  expected  and 
actual  level  of  a chain  is  the  level  of  its  parent  plus  1.  Only 
inference  operations  whose  results  have  expected  lengths  and 
levels  not  exceeding  the  length  or  level  maxima  will  be  attempted 
(this  way  of  implementing  length  and  level  maxima  was  also  used 
by  QA3.6  and  SLRTP)  . 

Two^sets  of  length  and  level  multipliers  were  tried.  The 
first  has  a length  multiplier  of  101  and  a level  multiplier  of 
100  and  is  called  the  101/100  strategy.  In  the  101/100  strategy, 
the  projected  inference  operation  with  highest  merit  is  one  with 
the  smallest  value  of  (100  times)  the  sum  of  expected  length  and 
level  of  the  result.  Ties  are  resolved  in  favor  of  lesser 
expected  length  (a  100/101  strategy  would  resolve  ties  in  favor 
of  lesser  expected  level)  . (It  is  assumed  here  that  the  expected 
length  of  a chain  will  never  exceed  100.)  The  most  important 
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thing  to  note  about  the  101/100  strategy  is  that  it  is 
essentially  the  same  as  Kowalski  and  Kuehner's  upper  diagonal 
search  strategy  [25].  It  is  ar»  admissable  strategy  135]  except 
for  cases  where  the  postprocessing  operation  removes  B-literals 
by  ground  factorization  or  reduction.  First  proofs  discovered  by 
admissable  strategies  are  guaranteed  to  be  minimum  level  proofs. 

The  second  strategy  has  a length  multiplier  of  501  and  a level 
multiplier  of  100  and  is  called  the  501/100  strategy.  In  the 
501/100  strategy,  the  projected  inference  operation  with  highest 
merit  is  one  with  the  smallest  value  of  (lUO  times)  the  sum  of 
the  expected  level  and  5 times  the  expected  length  of  the  result. 
Ties  are  again  resolved  in  favor  of  lesser  expected  length.  By 
multiplying  length  by  5 times  as  much  as  level,  a strong  length 
preference  strategy  is  produced.  The  501/100  strategy  is,  of 
course,  inadmissable  since  it  Is  clearly  not  always  the  case  that 
it  requires  at  least  5 inference  operations  to  remove  a single 
literal.  (For  a strategy  to  be  admissable,  the  estimated 
additional  cost  to  solution  must  always  be  less  than  or  equal  to 
the  actual  additional  cost  to  solution.) 

The  101/100  and  501/100  strategies  were  each  tried  with 
(bounded)  and  without  (unbounded)  length  and  level  maxima.  The 
length  and  level  maxima  used  were  those  used  by  QA3.6  wherever 
possible. 
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2.10.4.  Statistics.  The  pecfocmance  of  strategies  will  be 
primarily  characterized  by  the  "chains  generated"  statistic. 

Here,  this  information  is  represented  by  a 4-tuple:  the  first 
component  is  the  number  of  chains  retained;  the  second  component 
is  the  number  of  acceptable  chains  generated;  the  third  component 
is  the  total  number  of  chains  generated;  the  fourth  component  is 
the  number  of  attempted  inference  operations.  The  number  of 
retained  chains  is  the  number  of  acceptable  chains  minus  the 
number  of  chains  eliminated  by  subsumption,  function  depth  tests, 
etc.  No  such  processes  were  used  to  eliminate  chains  in  this 
experiment,  so  the  number  of  retained  chains  is  always  equal  to 
the  number  of  acceptable  chains.  The  total  number  of  chains 
generated  is  the  number  of  acceptable  chains  plus  the  number  of 
non-acceptable  chains  generated.  These  statistics  and  the  time 
figures  referred  to  below  are  automatically  accumulated  by  PSTP 
and  printed  out  when  a proof  is  found. 

Nearly  comparable  statistics  are  presented  where  available  for 
QA3.6  and  SLRTP  (except  QA3.6  statistics  refer  to  clauses  rather 
than  chains) . Data  for  the  number  of  acceptable  chains  generated 
is  summarized  in  Table  2.1  as  well  as  being  available  in  Appendix 
A. 

Best  and  mean  performance  figures  are  presented  for  QA3.6  on 
each  example.  For  QA3.6,  the  number  of  retained  clauses  is  the 
number  of  retained  clauses  after  subsumption  and  function  depth 
tests;  the  number  of  acceptable  clauses  is  computed  as  the  number 
of  successful  resolutions  and  factorings;  the  number  of  attempted 
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Acceptable  Chainc  Generated 
unbounded  bounded  best  mean 

Problem SLRTP 


1.  BURSTALL 

191 

74 

45 

45 

42 

118 

48 

2.  SHORTS URST 

18 

16 

16 

16 

12 

21 

16 

3.  HAS -PARTS 1 
impl icative 

7 

7 

7 

7 

dis j unctive 

12 

12 

12 

12 

10 

24 

12 

4.  HAS -PARTS 2 
impl icative 

11 

11 

11 

11 

dis j unctive 

50 

38 

38 

38 

14 

51 

20 

5.  PRIM 
impl  icative 

812 

57 

70 

54 

dis  j unctive 

165 

101 

130 

101 

19 

97 

134 

6.  ANCESl 

23 

13 

13 

13 

12 

13 

14 

7.  NUMl 

10 

10 

10 

10 

10 

11 

9 

8.  GROUP 1 

14 

14 

14 

14 

7 

7 

12 

9.  GROUP 2 

576 

119 

225 

225 

74 

82 

7 

Table 

2.1 

inference  operations  is  computed  as  the  number  of  attempted 
resolutions  and  factorings.  The  proportion  of  tested  QA3.6 
strategies  which  discovered  a proof  is  given  on  the  same  line  as 
the  mean  performance  of  QA3.6  strategies;  unsuccessful  strategies 
were  excluded  in  computing  the  means. 

Performance  figures  are  presented  for  SLRTP  where  the  set  of 
support  for  the  refutation  was  the  negation  of  the  theorem  in  the 
PSTP  proof  on  each  example.  Due  to  the  similarity  of  operations 
and  terminology  between  the  linear  MESON  procedure  and  the 
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inference  system  for  SLRTP,  SL-resolution  125],  we  will  here 
present  a brief  description  of  SL-resolution. 

SL-tesolution,  a refutation  procedure,  can  be  viewed  as  a 
variant  of  model  elimination  without  equality  with  the  following 
features.  (1)  The  capability  for  reordering  B-literals  at  the 
end  of  a chain  is  formalized  in  the  form  of  a literal  selection 
function  which  designates  the  literal  to  be  extended  on  in 
succeeding  extension  operations.  (2)  Factorization  is  a required 
operation  for  completeness  in  SL-resolution  since  the  equivalent 
of  the  STRONG-SAVE  postprocessing  operation  is  employed.  The 
model  elimination  factorization  and  reduction  operations  are 
combined  into  the  SL-resolution  reduction  operation.  (3) 
SL-resolution  requires  a fully  factored  input  set  of  clauses, 
i.e.,  every  non-tautologous  factor  of  an  input  axiom  must  also  be 
input  (or,  as  in  SLRTP,  derived).  A benefit  of  this  is  that 
SL-resolution  reduction  operations  need  never  be  performed  with 
the  leftmost  involved  literal  being  or  following  the  last 
A-literal  of  the  chain.  (4)  Any  B-literal  following  the  last 
A-literal  of  the  chain  is  a candidate  for  removal  by  the 
reduction  operation,  not  just  the  rightmost  as  in  model 
elimination.  (5)  Upper  diagonal  search  is  the  prescribed  search 
strategy  for  SL-resolution. 

For  SLRTP,  the  number  of  retained  chains  is  the  number  of 
retained  chains  after  function  depth  tests  and  subsumption 
(subsumption  is  only  used  in  eliminating  redundant  axiom  chains 
or  their  factors  during  the  process  of  generating  a fully 
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factored  input  set  of  axioms);  the  number  of  acceptable  chains  is 
computed  as  the  number  of  successful  extensions,  reductions,  and 
factorings  (used  only  for  generating  a fully  factored  input  set 
of  axioms);  the  number  of  attempted  inference  operations  is 
computed  as  the  number  of  attempted  extensions,  reductions,  and 
factorings.  The  GROUPl  and  GR0UP2  example  statistics  are  taken 
from  [2]. 

The  "search  time"  statistic  represents  the  time  spent  in 
searching  for  a proof  by  a compiled  version  of  PSTP;  it  excludes 
time  spent  in  inputting  the  problem,  outputting  of  final 
statistics  and  proof,  and  garbage  collection,  although  it  does 
include  time  required  for  some  trace  output  during  the  search. 
Search  time  is  the  only  widely  variable  component  of  total  time 
to  solution  with  problem  input  and  statistics  and  proof  output 
time  relatively  constant  and  small.  Although  PSTP  is 
conservative  of  storage  (performing  LISP  CONS  operations  only 
when  necessary  when  instantiating  chains)  and  therefore 
ordinarily  requires  few  garbage  collections,  garbage  collection 
time  is  excluded  because  (1)  time  consuming  garbage  collections 
occurring  at  random  times  in  the  search  for  a proof  tend  to 
randomize  the  time  statistics  especially  for  short  searches  (this 
problem  could  be  overcome  by  always  starting  a search  for  a proof 
immediately  after  a garbage  collection)  and  (2)  frequency  of 
garbage  collection  is  dependent  on  the  amount  of  storage 
available  (with  infinite  storage,  there  need  not  be  any  garbage 
collections) . Nearly  all  the  proofs  presented  here  were  found 
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with  about  25U00  words  available  for  storing  chains  and  most  were 
found  with  no  garbage  collections. 

Time  statistics  should  not  be  used  for  comparison  among 
strategies  used  by  different  theorem  provers  without  considerable 
caution  and  more  information  than  is  usually  available.  Such 
statistics  are  of  course  influenced  by  the  machine  and  operating 
I system  used,  language  and  coding  of  the  theorem  prover , whether 

I 

compiled  (SLRTP,  PSTP)  or  interpreted  (QA3.6),  special  conditions 
applying  to  the  operation  of  the  theorem  prover  (e.g.,  tracing), 

1 

and  some  randomness  in  the  times  themselves  (such  randomness, 

f 

J attributable  to  variable  load  on  the  time  sharing  system,  is 

^ visible  in  some  anomalies  in  the  statistics  presented  here). 

I ( 

2.10.5.  Results.  Four  primary  observations  can  be  made  from 
the  results  of  the  experiment  presented  here;  (1)  PSTP  performs 
competitively  with  QA3.6  and  SLRTP,  (2)  the  501/100  strategy 
performs  better  than  the  101/100  strategy  (for  these  examples), 

(3)  the  501/100  strategy  is  relatively  insensitive  to  length  and 
I level  bounds,  the  101/100  strategy  is  much  more  sensitive,  and 

^ (4)  elimination  of  some  implicative  forms  of  the  axioms  can 

result  in  improved  performance. 

The  basis  for  comparison  of  the  results  of  PSTP  and  QA3.6  is 
the  number  of  acceptable  chains  generated  (equals  the  number  of 
chains  retained)  for  PSTP  versus  the  number  of  acceptable  clauses 
generated  (equals  the  number  of  successful  resolutions  and 
factorings)  for  QA3.6.  This  is  a fairer  comparison  than  one 
using  the  number  of  retained  clauses  for  QA3.6  since  QA3.6 
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eliminated  clauses  by  function  depth  maxima  and  subsumption. 

Even  this  comparison  is  still  somewhat  unfair  to  FSTP  since  if 
function  depth  tests  and  subsumption  had  not  been  used  in  QA3.6, 
the  number  of  generated  clauses  would  presumably  have  been  larger 
since  eliminated  clauses  could  now  act  as  parent  clauses  in 
additional  inferences. 

Using  this  basis  for  comparison,  the  unbounded  501/100 
strategy  (the  strategy  we  prefer  for  reasons  given  below) 
performed  better  than  the  average  of  QA3.6  strategies  which  found 
a proof  in  all  the  examples  except  PRIM,  GROUPl,  and  GROUP2. 

In  the  PRIM  example,  the  unbounded  501/100  strategy  performed 
only  slightly  worse  than  the  average  of  QA3.6  strategies. 

In  the  GROUPl  example,  the  absolute  difference  in  performance 
is  small  even  if  the  number  of  chains  generated  by  PSTP  is  double 
the  number  of  clauses  generated  by  OA3.6.  In  view  of  the  fact, 
for  example,  that  by  reversing  the  order  of  presentation  of  the 
axioms  to  PSTP  can  cause  the  performance  of  PSTP  to  exceed  that 
of  OA3.6,  we  tend  to  regard  this  difference  as  being  relatively 
insignificant . 

The  difference  in  the  case  of  GROUP2  is  much  more  serious  and 
has  a rather  different  explanation.  Where  the  formulation  of  the 
GROUP2  example  has  several  unit  axioms  and  two  4-literal 
associative  axioms,  the  use  of  a length  maximum  value  of  3 can  be 
seen  to  be  extremely  restrictive.  In  resolution  terms,  this 
length  maximum  requires  that  only  units  be  resolved  against  the 
associative  axioms,  and  in  the  case  of  GROUP2  if  the  negation  of 
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the  theorem  is  used  as  the  set  of  support,  the  length  maximum 
automatically  restricts  any  tested  strategy  to  a further 
refinement  of  unit  resolution  in  which  only  the  negation  of  the 
theorem  can  be  directly  resolved  against  the  associative  axioms. 
P3TP  was  tested  with  a variant  of  the  GR0UP2  example  in  which 
axiom  3 was  reordered  so  that  a proof  meeting  the  length  maximum 
value  of  3 restriction  existed.  On  this  example,  the  unbounded 
101/100  strategy  generated  435  chains;  all  the  other  strategies 
generated  29.  We  therefore  feel  that  the  better  performance  of 
QA3.6  on  this  problem  was  more  attributable  to  the  restrictive 
length  maximum  than  to  an  intrinsic  inferiority  of  PSTP. 

The  comments  about  the  restrictive  length  bound  used  by  QA3.6 
in  the  GROUP2  example  can  be  extended  to  several  other  examples. 
The  BURSTALL,  SHORTBURST,  HAS-PARTSl,  HAS-PARTS2,  PRIM,  ANCESl, 
GROUPl,  and  GROUP2  examples  all  had  very  restrictive  length 
maxima,  in  every  case  set  at  or  below  the  minimum  value  required 
for  PSTP  to  discover  a proof.  Level  maxima  were  often  similarly 
restrictive  although  we  perceive  this  to  be  much  less  important 
in  reducing  the  size  of  the  search  in  the  non-depth  first  search 
strategies  tested.  We  feel  that  use  of  such  restrictive  length 
and  level  (especially  length)  bounds  invalidates  the  results  of 
[36]  to  a degree,  since  their  use  imposes  severe  limitations  on 
the  structure  of  the  search  space.  In  this  restricted  search 
space,  tests  of  different  strategies  may  fail  to  discriminate 
between  strategies,  or  unfairly  discriminate  between  them. 

Given  the  similarity  of  SL-resol ution  and  the  linear  MESON 
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procedure,  one  would  anticipate  substantial  similarity  in  the 
results  for  PSTP  using  the  bounded  101/100  strategy  (upper 


f 


I 


diagonal  search)  and  SLRTP.  For  6 of  the  examples  (5URSTALL, 
SHORTBURST,  HAS-PARTSl,  PRIM,  ANCESl,  and  NUMl),  the  results 
agree  closely.  Differences  emerge  for  the  remaining  3 examples. 
We  don't  know  why  PSTP  did  so  much  worse  than  SLRTP  on  the 
HAS-PARTS2  example.  In  the  GROUPl  example,  use  of  a fully 
factored  input  set  of  clauses  was  clearly  beneficial  to  SLRTP 
since  the  very  short  proof  could  be  shortened  further  by  using  a 
factored  form  of  one  of  the  associative  axioms.  PSTP  with 
factorization  could  not  match  the  SLRTP  results,  since  extension 
by  the  associative  axiom  followed  by  factorization  counted  as  2 
inference  operations  in  computing  the  level  of  the  resulting 
chain  (whose  value  is  used  to  compute  the  order  function  value) 
whereas  extension  by  the  factored  associative  axiom  by  SLRTP 
counts  as  only  1 inference  operation.  (PSTP  could  be  made  to 
equal  SLRTP 's  performance  on  this  example  by  inputting  a fully 
factored  set  of  axioms,  a perfectly  legal  operation,  although 
unnecessary  for  completeness.)  In  the  case  of  the  GROUP2 
example,  for  which  SLRTP  failed  to  find  a solution,  both  the 
bounded  and  unbounded  101/100  strategies  in  PSTP  showed  relative 
difficulty  in  discovering  a solution.  In  SLRTP,  this  difficulty 
was  exacerbated  by  the  very  feature  which  aided  the  quick 
solution  of  the  GROUPl  example:  mandatory  factorization.  PSTP 
with  factorization  and  the  STRONG-DELETE  postprocessing  operation 
(resulting  in  an  inference  system  very  similar  to  SL-resolution) 


111 


failed  to  find  a proof  with  the  unbounded  lOl/lUO  strategy  after 
1507  chains  were  generated,  discovered  a proof  while  generating 
410  chains  with  the  unbounded  501/100  strategy,  and  discovered  a 
proof  while  generating  458  chains  with  each  of  the  bounded 
strategies.  The  proofs  discovered  were  the  same  as  those 
discovered  without  factorization. 

We  believe  the  detrimental  effects  of  factorization  as 
demonstrated  in  the  GROUP2  example  results  are  more  typical  than 
the  beneficial  effects  illustrated  in  the  GROUPl  example.  In  our 
experience,  even  in  cases  where  factorization  does  shorten  a 
proof  (as  it  did  not  in  the  GROUP2  example),  the  proliferation  of 
highly  instantiated  chains  caused  by  the  use  of  factorization 
still  often  outweighs  the  benefits.  (These  negative  comments 
clearly  refer  only  to  general  factorization  where  literals  must 
be  unified;  factorization  in  the  ground  case  is  clearly 
beneficial  and  is  included  in  the  postprocessing  operations  we 
used  here.)  Should  future  experience  prove  this  judgment  about 
factorization  wrong,  the  linear  MESON  procedure  still  permits 
factorization  as  a legal  though  optional  operation. 

Another  point  can  be  made  here  concerning  SLRTP's  efforts  to 
discover  a solution  to  the  GROUP2  example.  SLRTP  uses  a literal 
selection  function  to  designate  which  literal  of  each  derived 
chain  is  to  be  used  in  future  extension  operations.  The  only 
literal  selection  function  tested  was  the  function  which  always 
selects  a literal  which  has  the  fewest  matching  literals  among 
the  axioms.  This  has  the  obviously  desirable  characteristic  of 
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reducing  the  branching  rate  o£  the  search  tree  since  the  selected 
literal  has  the  fewest  matches  among  the  axioms  and,  further, 
removal  of  the  selected  literal  after  some  inference  operations 
will  usually  instantiate  the  remaining  literals  and  reduce  the 
number  of  literals  among  the  axioms  matching  them.  However,  this 
literal  selection  function  is,  in  the  case  of  problems  with 
structure  similar  to  the  GROUP2  example,  inconsistent  with  the 
use  of  length  maxima.  In  GR0UP2,  for  example,  the  literal 
selection  function  will  show  a preference  for  literals  capable  of 
being  extended  upon  only  by  the  associative  axioms  (since  any 
positive  literal  matches  the  consequent  of  the  associative 
axioms,  any  literal  matching  a unit  axiom  also  matches  the 
associative  axioms).  Thus,  the  effect  of  the  use  of  this  literal 
selection  function  is  to  increase  the  length  of  chains  appearing 
in  a deduction  possibly  requiring  the  increase  of  the  length 
maximum  used. 

One  final  point  remains  about  the  comparison  of  results 
between  PSTP  and  SLRTP.  This  concerns  the  very  small  number  of 
attempted  inference  operations  by  SLRTP.  This  is  due  to  the  use 
of  a literal  classification  tree  which  automatically  selects  out 
likely  matches  for  literals  to  be  extended  upon  from  among  the 
literals  in  the  axioms.  The  extension  operation  is  only 
attempted  for  axioms  containing  literals  selected  by  the  literal 


classification  tree.  This  probably  represents  a fairly  small 


(though  real)  saving  in  computational  effort  since  one  must  count 


the  cost  of  creating  and  accessing  the  literal  classification 
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tree  and  the  cost  saved  is  that  of  attempting  unifications 
destined  to  fail,  usually  a fairly  quick  operation.  The  real 
benefit  of  use  of  the  literal  classification  tree  is  the 
elimination  of  the  multiple  attempts  at  unifying  literals  that 
would  ordinarily  result  from  use  of  the  literal  selection 
function  requiring  discovery  of  the  number  of  matches  for  a 
literal  among  literals  in  the  axioms. 

In  comparing  the  four  strategies  tested  by  PSTP  among 
themselves,  one  first  discovers  that  the  501/100  strategy 
invariably  performed  as  well  as  or  better  than  the  101/100 
strategy  for  the  same  choice  of  length  and  level  maxima.  This  is 
especially  true  of  the  results  for  the  BURSTALL,  GROUP2,  and  PRIM 
examples  in  the  absence  of  length  and  level  maxima.  A further 
demonstration  of  the  superiority  of  the  501/100  strategy  is  its 
relative  insensitivity  to  length  and  level  bounds.  Only  in  the 
BURSTALL  example  did  the  bounded  501/100  strategy  perform 
significantly  better  than  the  unbounded  strategy.  Also,  in  the 
PRIM  and  GROUP2  examples,  the  addition  of  length  and  level  bounds 
actually  degraded  the  performance  of  the  501/100  strategy  since 
the  bounds  excluded  proofs  discovered  by  the  unbounded  strategy. 
In  contrast,  performance  of  the  101/100  strategy  was  often 
improved  by  the  addition  of  length  and  level  bounds,  but  (as 
stated  above)  never  improving  upon  the  performance  of  the  501/100 
strategy.  The  demonstrated  insensitivity  of  the  501/100  strategy 
to  the  addition  of  length  and  level  bounds  seems  especially 
significant  in  view  of  the  often  extreme  restrictiveness  of  the 


bounds  tested. 


Due  to  its  generally  good  performance  and  lack  of  improvement 


with  the  addition  of  bounds,  we  regard  the  unbounded  5U1/100 
strategy  as  the  best  among  those  tested. 

We  feel  generally  that,  provided  it  performs  adequately,  a 
complete  (e.g.,  length  preference)  strategy  like  the  unbounded 
501/100  strategy  is  to  be  preferred  to  an  incomplete  (e.g., 
length  bounded)  strategy  like  the  bounded  lOl/lUO  strategy,  even 
if  the  latter,  with  appropriate  choice  of  bounds,  can  often  match 
the  performance  of  the  former. 

Finally,  we  merely  note  that  judicious  elimination  of  various 
implicative  forms  of  the  axioms  can  result  in  significantly 
improved  performance  as  demonstrated  in  the  results  for  the 
HAS-PARTSl,  HAS-PARTS2,  and  PRIM  examples.  Of  course,  this 
elimination  of  implicative  forms  of  the  axioms  destroys  the 
completeness  property  of  the  linear  MESON  procedure.  However, 
this  controlled  incompleteness  may  be  desirable  in  cases  where 
significant  improvement  in  performance  results.  Completeness 
could  be  preserved  and  nearly  the  same  effect  gained  by 
presenting  PSTP  with  all  the  implicative  forms  of  the  axioms,  but 
(via  the  order  function  definition)  giving  PSTP  a strong 
preference  for  using  one  instead  of  another. 

One  feature  of  the  linear  MESON  procedure  not  previously 
discussed  is  the  length  of  its  proofs.  It  is  characteristic  of 
linear  theorem  proving  strategies  that  they  require  longer  proofs 
than  some  other  strategies.  This  and  past  studies  [1,12] 
indicate  that  linear  strategies  can  overcome  this  increased  proof 
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length  and  perform  competitively  with  other  procedures.  The 
linear  resolution  strategy  tested  in  QA3.6  was  less  successful 
since  the  special  chain  rejection  criteria  of  variants  of  the 
model  elimination  procedure  were  not  used. 

While  the  length  of  a proof  is  one  measure  of  its  complexity, 
we  feel  that  the  increased  length  of  linear  MESON  procedure 
proofs  is  not  a great  disadvantage  in  terms  of  readability.  The 
problem- reduction  method  oriented  form  of  such  proofs  often  makes 
them  more  comprehensible  than  ordinary  resolution  proofs  relying 
on  converging  lines  of  deduction  resulting  in  a refutation. 

2.11.  V theorem  proving  procedure 

The  * theorem  proving  procedure  is  a new  variant  of  the  MEISON 
procedures  which  is  strongly  related  to  the  definition  we  gave 
for  the  problem  reduction  method. 

Where  the  problem  reduction  method  has  a single  inference 
operation  (the  problem  reduction  operation) , the  n procedure  has 
a single  inference  operation  (the  « reduction  operation).  In  the 
problem  reduction  method,  generated  subgoals  are  solved  using 
only  the  original  assertions  of  the  problem.  In  the  t procedure, 
generated  subgoals  are  solved  using  the  original  assertions  of 
the  problem  plus  the  negation  of  the  generating  goal.  This 
modification  is  all  that  is  required  to  extend  the  problem 
reduction  method  to  completeness  for  the  propositional  calculus 
and  with  appropriate  qualification  to  the  first  order  predicate 
calcul  us . 
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The  , procedure  makes  no  assumptions  about  the  data  structure 


or  format  partial  deductions,  but  only  defines  the  single 
inference  operation  of  using  implications  to  generate  subgoals 
and  defining  the  context  in  which  these  subgoals  must  be  proved. 
The  MESON  procedures  can  be  regarded  as  forms  of  the  v procedure 
which  make  assumptions  about  the  data  structure:  trees  for  the 
MESON  procedure,  chains  for  the  linear  MESON  procedure.  They 
also  incidentally  differentiate  between  inference  operations 
using  implications  of  the  original  problem  and  those  using  higher 
goals.  In  the  * procedure,  the  two  types  of  assertions 
(assertions  of  the  original  problem  and  negations  of  higher 
goals)  are  treated  uniformly. 

We  will  treat  in  detail  only  the  propositional  calculus  form 
of  the  t procedure  with  the  understanding  that  it  can  readily  be 
lifted  to  the  first  order  predicate  calculus  by  conventional 
techniques  such  as  those  used  for  the  resolution  and  model 
elimination  theorem  proving  procedures. 

Following  is  a definition  of  the  » procedure  in  terras  of  its 
single  inference  operation,  the  » reduction  operation. 

PSli0i£i2D  2^12*  Bj^, . . . ,Bpt-,C  if 
B|^  isA^'^  ...  ♦C  (m^O) 

and  Bi,...,Bp,-'C'-*Ai 

and  Bi,...,Bp,~C^-,Ai 

and  B^, . . . ,Bp,~C'",Ajjj  for  some  k (l£k£p)  . 
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The  T procedure  is  so  named  because  (1)  in  physics  the  « meson 
is  a type  of  meson  thus  suggesting  the  relationship  to  the  MESON 
procedure,  (2)  « spelled  out  in  Latin  letters  in  w suggesting 
oremises  and  implications,  the  two  types  of  assertions  used  in 
the  MESON  procedure,  and  (3)  the  Greek  letter  « corresponds  to 
the  Latin  letter  p so  that  the  t reduction  operation  suggests  the 
problem  reduction  operation. 


?d£2£S!S  2 .4 . The  » procedure  is  sound,  i.e.,  if  . . . ,Bpt-^C 
then  B2,...,Bpt-C  where  denotes  derivability  by  the  * 
proced ure. 

Proof.  We  will  perform  induction  on  the  number  of 
applications  n pf  the  single  v procedure  inference  rule  used  in  a 
¥ procedure  proof. 

n*l.  Assume  B^, . . . , Bpi-^C  with  only  one  application  of  the 
inference  rule  used  in  the  proof.  Since  only  one  application  of 
the  inference  rule  is  required,  B|^  must  be  C itself.  Otherwise, 
additional  applications  would  be  required  to  prove  ...,  and 

A^.  Clearly  Bi , . . . ,B|^.2.»C,B|^+]^, . . . ,Bp>“C.  Therefore,  the  » 
procedure  is  sound  for  this  case. 

n>l.  Assume  the  w procedure  is  sound  for  all  proofs  with 
fewer  than  n applications  of  the  inference  rule.  Prove  it  is 
sound  for  n applications.  Assume  Bj^, . . . using  n 

applications  of  the  inference  rule. 

The  proof  of  C employs  the  subproofs  Bj^, . . . , Bp  .-C-^Aj^,  ..., 

Bj^, . . . ,Bp,~C'~,Ai  , ...,  and  B^, . . . ,Bp,~C'-^Ajn  where  Bj^  is  A^  ^... 
''Aj^  ♦ C for  some  k (l£k£p)  . By  the  induction  hypothesis,  since 
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each  o£  these  subpcoofs  has  fewer  than  n applications  of  the 
inference  rule,  , Bp  ,~C>-A2,  •••,  . . . ,Bp,~Ct-A^  , and 

Bj^, . . . , Bp  ,"'C^Ajjj.  Thus,  Bj, . . . , Bp  ,-C*-A2  . ^Ajjj  and,  by  the 

deduction  theorem,  B2^,...,Bp'”  ~C  ♦ Aj  a...  <^Ajjj.  Taking  the 
contrapositive  of  the  theorem,  we  get  B2,...,Bp»-  -*(Aj^  a...  aAjjj) 

♦ C.  But,  since  B|^  is  Aj^  a..,  ♦ C we  also  have  Bj^, . . . , Bp*-A]^ 

A...  Aft^  ♦ C.  Therefore,  , Bp'-(Aj^  a.,,  v -,(Aj^  a... 

aAjj^)  ♦ C and,  by  simplification,  Bj^, . . . , Bpi-C.  Therefore,  the  » 
procedure  is  sound  for  this  case.  □ 

In  order  to  prove  the  completeness  of  the  w procedure,  we  will 
first  prove  the  following  lemma.  The  lemma  relates  to  a 
procedure  related  to  the  « procedure  which  operates  on  clauses 
rather  than  on  implications.  Thus,  at  each  step  of  the 
procedure,  rather  than  matching  a goal  to  the  consequent  of  an 
implication  and  forming  subgoals  of  the  antecedents  of  the 
implication,  the  procedure  of  the  lemma  matches  any  literal  of  a 
clause  and  forms  subgoals  of  the  negated  remaining  literals  of 
the  clause.  By  the  definitions  of  the  v,  a,  and  ♦ connectives, 
it  can  be  readily  seen  that  the  two  processes  are  equivalent. 

The  reason  for  proving  completeness  results  in  terms  of 
clauses  rather  than  more  general  assertions  is  that  if 
implications  are  used,  two  different  but  logically  equivalent 
(meaning  they  are  represented  by  the  same  clause)  may  be 
necessary  in  the  proof.  This  need  for  different  (but  logically 
indistinguishable)  assertions  would  complicate  the  completeness 
proofs . 
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The  assertion  Aj^  a ...  a A^,  ♦ Cj  v ...  v can  be  converted 
to  the  clause  -Aj^  v ...  v - Aj„  v v ...  v Cj^. 

The  lemma  concerns  fully  unit  subsumed  sets  ot  clauses.  A 
fully  unit  subsumed  set  of  clauses  is  a set  of  clauses  in  which 
there  is  no  clause  which  is  subsumed  by  a unit  clause  in  the  set 
(except,  of  course,  for  the  unit  clause  itself) . This  is  not  a 
requirement  of  the  procedure,  but  aids  in  the  proof  of 
completeness.  The  use  of  fully  unit  subsumed  sets  of  clauses 
does  have  implications  for  some  features  of  the  procedure,  i.e., 
the  elimination  of  goals  which  are  complementary  to  higher  goals 
and  the  elimination  of  goals  which  are  instances  of  axioms,  which 
will  be  noted  later. 


^mma  2^^! . Assume  . . . ,Bp}  is  a fully  unit  subsumed  set  of 

clauses  not  including  the  clause  ~C  and  {Bu  ,...,Bi,  ,~C}  is  a 

1 •'p ' 

minimally  inconsistent  set  for  some  subset  {Bu.  ,...,Bi,  } of 

IS  1 Xp  • 

{B^, . ..,6p} . Th en  B^,..., Hp ^ ^ C . 

f£22^*  Induction  will  be  performed  on  the  number  of  non-unit 
clauses  k among  the  clauses  B2,...,Bp. 

Case  1.  k«0.  If  k*0 , then  C « {B2,...,Bp}  and  Bj^, . . . , Bp*-,C 

is  immediate  and  the  lemma  is  true  for  this  case. 

Case  2.  k>0.  Assume  the  lemma  is  true  for  all  sets  of 

clauses  with  number  of  non-unit  clauses  less  than  k.  Prove  it  is 
true  for  all  sets  of  clauses  with  number  of  non-unit  clauses 
equal  to  k. 

Assume  {Bj^,...,Bp}  includes  k non-unit  clauses. 

Case  2a.  C « {B2,...,Bp}.  B2,...,Bp^,C  is  immediate  and  the 

lemma  is  true  for  this  case. 
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Case  2b.  -C  g This  is  contrary  to  the 

assumption  of  the  lemma  and  thus  cannot  occur. 

Case  2c.  Neither  C nor  ~C  £ . . . , Bp) . Since  -C  is  in  a 

minimally  inconsistent  set  {Bi.  /""C)  by  hypothesis,  some 

ivi  IS.p, 

Bj^  . must  be  of  the  form  . . va^. 

Each  {B.,  ....,Bt  ,A4  . ,B^  ,...,B,,  j-'C)  is  inconsistent 

Ki  X Kj+l  Kp. 

(l<i'/i<m).  Further,  each  A^ i is  in  a minimally  inconsistent 
subset  of  this  set. 

Let  B be  the  set  {Bj, . . . , Bp,~C}  with  all  clauses  containing  ~C 

except  the  unit  clause  ~C  itself  removed.  B is  fully  unit 

subsumed  by  virtue  of  the  removal  of  all  clauses  containing  --C 

except  ~C  itself  and  the  fact  that  {B2^,...,Bp}  is  fully  unit 

subsumed.  The  minimally  inconsistent  set  {Bi.  ,...,Bi,  ,'~C) 

1 P' 

contains  none  of  the  clauses  subsumed  by  ~C  by  minimality.  Thus, 

the  subset  of  { Bj^^, . . . , B|^ ^ 1 '®kj+l ' * • ’ '®kpi  ^ each  A^  i which 

with  Aj^i  forms  a minimally  inconsistent  set  is  also  a subset  of  J 

B.  No  A^t  € B since  none  was  in  {Bj^,...,Bp}  since  {Bj^,...,Bp)  is 

assumed  to  be  fully  unit  subsumed  and  A^ , subsumes  Bu  . Thus  B 

D 

meets  all  the  assumptions  of  the  lemma  and  has  less  than  x 
non-unit  clauses.  By  the  induction  assumption,  Bt-^-A^i  for  each 
Aj^  I . The  proof  of  each  -A^  i constitutes  a proof  of  C and  the  ^ 

lemma  is  true  for  this  case.  □ ' 

I!i®2£S2!  2^5.  The  » procedure  is  complete,  i.e.,  if  Bj^,...,Bq 
are  consistent  and  Bj^, . . . ,Bp>-C  then  Bj^, . . . , Bpi-^C  i 

fI22l.  Let  B be  the  subset  of  Bi,...,Bp  such  that  B u {"“C)  is  1 

a minimally  inconsistent  set.  There  must  be  such  a set  since 
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are  consistent  and  imply  C.  By  the  lemma, 

Bj^, . . . ,Bp>-jjC  since  a proof  of  C from  B is  also  a proof  of  C from 

Bp } • ij 

The  1 procedure  was  expressed  in  terms  of  a single  inference 
rule  («  reduction)  with  no  limitations  on  its  application.  The 
soundness  theorem  supports  the  validity  of  the  use  of  the  « 
reduction  rule  without  restriction.  However,  the  completeness 
theorem  suggests  several  restrictions  which  can  be  used  to 
substantially  improve  the  efficiency  of  the  « procedure.  These 
restrictions  are  enumerated  below. 

In  a * procedure  deduction,  define  an  opened  goal  to  be  a goal 
to  which  1 reduction  has  been  applied  resulting  in  at  least  one 
subgoal;  define  an  unopened  goal  to  be  a goal  without  subgoals. 

1.  l4o  goal  may  have  as  a subgoal  (immediate  or  otherwise)  an 
identical  goal  (opened  or  unopened)  . This  is  equivalent  to  the 
model  elimination  admissability  rules  that  no  chain  with  an 
A-literal  followed  by  an  identical  B-literal  needs  to  be 
retained . 

The  justification  for  this  in  the  t procedure  is  that 
(examining  the  proof  of  the  completeness  theorem)  fully  unit 
subsumed  sets  of  clauses  are  used.  Thus,  if  A is  a higher  goal, 
the  completeness  theorem  employs  sets  of  clauses  no  clause 
(except  "“A  itself)  containing  "“A  in  proofs  of  the  subgoals. 

Thus,  no  V reduction  operation  can  result  in  a subgoal  of  A. 

2.  No  goal  may  have  as  a subgoal  (immediate  or  otherwise)  a 
complementary  opened  goal.  This  is  equivalent  to  the  model 
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elimination  admissabil ity  rule  that  no  chain  with  an  A-»literal 
followed  by  a complementary  A->literal  needs  to  be  retained. 

The  justification  -for  this  in  the  » procedure  is  that  the 
lower  goal  could  have  been  » reduced  by  the  negated  higher  goal. 
This  is  assumed  to  be  done,  in  some  alternate  deduction. 

3.  Uo  opened  goal  may  be  an  instance  of  an  original  premise. 
This  rule  has  no  counterpart  in  the  -formal  definition  of  the 
model  elimination  procedure  (all  admissability  rules  refer  to 
single  chains  and  take  no  notice  of  the  input  chains) , but  was 
recognized  and  successfully  used  by  Fleisig  et  al  (12]  and  in 
PSTP. 

The  justification  -for  this  in  the  * procedure  is  that  the 
opened  goal  could  have  been  « reduced  by  the  original  premise. 
This  is  assumed  to  be  done  in  some  alternated  deduction. 

4.  No  goal  may  be  an  instance  of  the  negation  of  an  original 
premise.  Tnis  rule,  though -fairly  obvious,  was  apparently 
previously  unrecognized,  and  has  only  recently  been  incorporated 
in  PSTP. 

The  justification  -for  this  in  the  « procedure  is  that 
(examining  the  proof  of  the  completeness  theorem)  -fully  unit 
subsumed  clauses  are  used.  .Thus,  if  A is  an  original  premise, 
the  completeness  theorem  .employs  sets  of  clauses  n''  clause 
(except  A itself)  -containing  A.  .Thus,  no  v ireduction  operation 
can  result  in  a subgoal  ~A.  (This  is  an  extension  of  rule  1 to 
axioms  as  well  as  negated  higher  goals.) 

Note.  Rule  4 has  the  -following  interpretation  in  terms  of  the 


model  elimination  procedure.  No  chain  with  an  A-literal  or 
B-literal  which  is  an  instance  of  a unit  axiom  needs  to  be 
retained.  If  the  literal  occurs  to  the  right  of  the  first 
A-literal  of  the  chain,  it  must  have  come  from  extension  by  some 
axiom.  But  this  (instance  of)  the  axiom  is  subsumed  by  the  unit 
axiom,  and  is  thus  unnecessary  to  a proof  (by  the  completeness  of 
the  model  elimination  procedure) . If  the  literal  occurs  to  the 
left  of  all  the  A-literals  of  the  chain,  then  the  (instance  of 
the)  top  chain  is  subsumed  by  the  unit  axiom.  Thus,  if  the  top 
chain  is  refutable,  so  is  the  unit  axiom,  and  there  exists  an 
alternate  refutation  (although  with  different  top  chain). 


2.12.  Multiple  world  states 

We  have  been  exclusively  conerned  up  to  this  point  with 
reasoning  within  a single  world  state.  That  is,  we  have 
considered  only  the  case  of  establishing  a theorem  in  the 
presence  of  some  axioms  without  considering  the  possibility  of 
the  set  of  axioms  being  impermanent  and  changeable. 

Yet  an  important  feature  of  artificial  intelligence  languages 
such  as  PLANNER  and  QA4/(3LISP  is  their  ability  to  represent 
different  world  states  by  adding  or  deleting  axioms.  Thus  we 
must  consider  the  effect  of  this  expanded  field  of  operation  on 
our  completeness  claims  for  the  , procedure  which  we  propose  as  a 
model  for  a complete  inference  system  for  artificial  intelligence 
lang uages. 
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As  demonstrated  by  Green  (171,  the  multiple  world  state 


situation  can  be  handled  directly  within  the  first  order 
predicate  calculus  (and  thus  by  the  unmodified  -g  procedure)  by 
the  use  of  state  variables  in  state-dependent  predicates. 

ive  will  here  distinguish  three  kinds  of  reasoning:  reasoning 
within  a world  state,  reasoning  between  world  states,  and 
reasoning  about  world  states. 

Reasoning  within  a world  states  corresponds  to  the 
conventional  theorem  proving  problem  and  is  directly  handled  by 
the  » procedure. 

By  reasoning  between  world  states,  we  mean  the  typical 
operation  of  PLANNER  and  QA4/QLISP  by  which,  in  order  to  achieve 
a goal,  some  operation  is  performed  which  results  in  a different 
state  by  the  addition  or  removal  of  axioms.  For  example,  the 
goal  [inroom  personl  , rooml]  may  be  achieved  by  the  walk 
operator  which  removes  [ in room  personl  , x]  for  all  x and  adds 
(inroom  personl  , rooml].  If  this  is  only  one  component  of  a 
larger  task,  the  solution  of  the  larger  task  would  continue  in 
this  altered  world  state.  Reasoning  between  world  states 
includes  explicit  transformations,  usually  corresponding  to 
physical  actions,  which  transform  one  state  to  another. 

Aside  from  the  state  modification  which  takes  place  when 
assertions  are  added  to  or  deleted  from  the  data  base  as  a result 
of  successful  application  of  a world  state  transformation,  the 
principal  effect  of  reasoning  between  world  states  is  the  aosence 
of  a need  to  assert  the  negation  of  a goal  (to  invoke  the 
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contradiction  mechanism)  if  the  goal  is  to  be  achieved  (producing 
a new  world  state)  rather  than  merely  inferred  (proved  in  the 
current  world  state)  . The  reason  for  this  is  that  the  predicate 
calculus  state  variable  description  of  the  state  to  be  achieved 
cannot  be  unified  with  the  state  variable  of  any  predecessor 
state.  Hence,  no  complemented  lower  goal  can  be  discovered. 

This  fact  is  further  justification  for  the  already  useful 
distinction  between  the  achieve  and  infer  goal  functions  of  n 
Popler  1.5  110].  The  infer  type  goal  tries  to  prove  the  goal  in 
the  present  world  state  and  would  need  to  (using  the  » procedure) 
check  for  complementarity  of  higher  and  lower  subgoals.  The 
achieve  type  goal  which  tries  to  produce  the  goal  would  not  need 
to  perform  this  checking. 

The  third  kind  of  reasoning  with  world  states,  reasoning  about 
world  staes,  may  involve  the  creation  of  hypothetical  world 
states  which  are  then  reasoned  about,  or  determining 
relationships  between  different  world  states.  Hewitt  [22] 
suggests  the  example  of  comparing  the  weight  of  Neil  Armstrong  on 
the  earth  (context4)  and  on  the  moon  (context^).  Present 
artificial  intelligence  languages  are  rather  weak  in  conveniently 
allowing  such  comparisons,  and  Hewitt  suggests  the  alternative 
of,  rather  than  storing  data  in  contexts,  putting  explicit 
context  tags  (called  situational  tags)  in  the  data.  This  is 
clearly  a general  and  complete  solution  to  the  problem  of 
relating  the  data  to  the  world  state. 

A curiosity  unmentioned  by  Hewitt  is 
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that  his  use  of 


situational  tags  (though  presented  with  different  rhetoric  and 
motivated  by  the  example  of  QA4/QLISP  contexts)  constitutes  a 
return  to  the  concept  of  Green's  predicate  calculus  state 
variables  from  which  situational  tags  are  essentially 
indistinguishable.  The  treatment  of  situational  tags  in  the 
fashion  of  predicate  calculus  state  variables  permits  the  « 
procedure  to  be  used  for  this  reasoning  about  world  states. 


2.13.  Equality 

In  analogy  to  the  MESON  procedure  for  equality,  there  is  a * 

I 

procedure  with  an  equality  inference  rule.  In  addition  to  the  * 
reduction  inference  rule,  the  » procedure  for  equality  includes  a 
single  additional  inference  rule,  the  * substitution  rule. 
Substitution  from  a higher  goal  (A-literal)  is  handled  by  i 
substitution  from  a locally  asserted  negated  higher  goal  just  as 
reduction  from  a higher  goal  is  handled  by  » reduction  from  a 
locally  asserted  negated  higher  goal. 

One  change  that  is  made  to  the  organization  of  the  * procedure 
in  preparation  for  use  of  « substitution  is  that  now  the  negated 
goal  is  asserted  before  any  inference  operation  is  attempted  on 
the  goal,  not  after  as  was  possible  when  only  » reduction  was 
used.  This  is  to  allow  for  substitution  of  a goal  to  itself 
(ME-P  p-reduction  to  self)  . 

The  following  defines  the  « substitution  inference  rule. 

2.31.  B3^,...,Bp  if 

B,^  or  -C  is  ♦ C'  {m>0) 
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and  . . . ,Bp,-C  (defined  below) 

and  , Bp  ,~C 

and  Bj^,  , , . , Bp ,~C  “"wAi 

and  Bj^,  . . . , Bp  ,~C  *",Ajj  for  some  k (l<k<p)  . 

Case  I.  ( n substitution  from  assertion)  If  C is  of  the  form 
a=b  or  b=a  and  C has  an  instance  of  term  b then  D is  C with  a 
single  instance  of  b replaced  by  a. 

Case  II.  (w  substitution  to  assertion)  If  C is  of  the  form  a/b 
or  b/a  and  C has  an  instance  of  term  b then  D is  -C ' with  a 
single  instance  of  b replaced  by  a. 

The  following  theorem  extends  our  previous  soundness  proof  for 
the  * procedure  without  equality  to  the  » procedure  with 
equality. 

2.6.  The  » procedure  with  equality  is  sound,  i.e.,  if 
B2,...,Bp^,C  then  B^,  . . . , Bp*“C. 

f£22f*  ''■6  will  perform  induction  on  the  number  of 

applications  n of  the  « reduction  and  « substitution  inference 
rules  used  in  a t procedure  proof. 

n*l.  Assume  B^, . . . , Bp*-,C  with  only  one  application  of  an 
inference  rule  in  the  proof.  Since  only  one  application  of  an 
inference  rule  is  required,  ir  reduction  must  be  used  with  B|^ 
being  C itself.  Otherwise,  additional  inferences  would  be 
required  to  prove  Aj^,...,Ajjj  in  the  case  of  a non-premise 
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application  of  « reduction,  or  to  prove  and  b in  the 

case  of  « substitution. 

n>l.  Assume  the  « procedure  is  sound  tor  all  proofs  with 
fewer  than  n applications  of  inference  rules.  Prove  it  is  sound 
for  n.  Assume  . . . , Epi-,C  with  n applications  of  inference 
rules . 

Case  1.  The  case  where  the  « reduction  rule  is  the  top 
inference  rule  application  is  already  proved  for  the  soundness 
theorem  for  the  » procedure  without  equality. 

Case  2.  The  case  where  the  » substitution  rule  is  the  top 
inference  rule  application  employs  the  subproofs 

B^,...,6p,^C^^0,  3^,..., Bp, ^C^ yA^f  .**f  B^,..., Bp , C^ ^A^,  ..., 

and  Bj, ,Bp,~C^-i,A^  where  B|^  or  ~C  is  Aj^  ...  ''Ajj,  * C for 

some  k (l^k£p)  . By  the  induction  hypothesis,  since  each  of  these 
“*  1 
subproofs  has  fewer  than  n applications  of  inference  rules, 

B ^ , . . . , B^, . . . , ...,  B^,..., ..., 

and  Bj, . . . ,Bp,~C>-Ajj,.  Therefore,  B^, . . . Bp-C-D  a a ...  a Ajj, 
and,  by  the  deduction  theorem,  B2,...Bp^-  ~C  ■»  D a a^  a ...  a Ajj, 
and  Bi,...,Bp'-  ~(D  a Aj  a ...  a Aj^)  ♦ C. 

Case  2a.  Consider  first  the  case  where  is  A^  a ...  a a^  * 
C,  i.e.,  the  case  where  C is  not  -C.  Then  B^, . . .Bpi-Aj^  a ...  a 
Ajj,  ■*  C.  From  this  plus  the  previous  fact  that  B2,...,Bp'“  - (D  a 
^1  •••  ''  ♦ C we  can  infer  that  B2*...»Bp^*  -i  (D  a c ' ) ♦ C. 

There  are  two  possible  cases  corresponding  to  cases  1 and  II  of 
the  » substitution  inference  rule  to  be  considered;  (1)  C is 
a>b  or  b>a,  C is  C(b)  , and  C is  C(a),  and  (2)  C*  is  C(b),  C is 
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a^b  or  b^a,  and  D is  -CCa) . we  will  construct  a truth  table  for 


the  values  of  C,  D,  C , and  ->  (D  a C ' ) ♦ C. 

c D j 

I 

T T T ! 

j 

TFT  j 

F T T I 

F F T 

■| 

T F F ! 

F T F I 

F F F I 

j 

We  need  only  consider  the  lines  in  which  -» (D  a C ' ) ♦ C is  tr ue 

since  we  know  this  to  be  a theorem  provable  from  In 

each  such  line  except  the  fifth,  C has  value  true  and  is  thus 
provable  from  satisfying  the  requirements  of  the 

theorem.  In  case  (1),  line  5 states  the  following  are  true: 

-iC(b),  C(a),  and  a»b  which  is  inconsistent  with  the  standard 
interpretation  of  equality.  In  case  (2),  line  5 states  the 
following  are  true:  a»b,  ~C(a),  and  C(b)  which  is  likewise 
inconsistent  with  the  standard  interpretation  of  equality.  Tnus, 

C is  provable  everywhere  -*(D  a c*)  ♦ C is  satisfying  the 
statement  of  the  theorem. 

Case  2b.  Now  consider  the  case  where  C is  ~C.  If  82^,...,  Bp 
are  inconsistent,  then  Bj:,...,Bp^C  immediately.  If  B2,«*.,Bp  | 

are  consistent,  and  we  have  proved  B2,...,Bp*-  '•(D  a Aj^  a ...  a 
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A^)  -»  C which  reduces  to  ~D  ♦ C,  or  B , . . . , Bp>-D  v c. 

If  D is  false,  then  D must  be  true,  and  B , . . . , Ep>-D.  If  D is 
true,  assume  C is  false.  Then,  by  substitution  of  equals  lor 
equals,  since  ~C  is  of  the  form  a=b  and  D is  derived  from  C by 
substituting  a for  b,  C and  D are  equivalent.  Contradiction. 
Therefore  , . . . , Bp‘-C.  U 

The  w procedure  for  equality  can  be  seen  to  be  complete  by 
analogy  to  the  MESON  procedure  for  equality,  and  ME-P  from  which 
both  are  derived. 

2.13.1.  Marking  and  other  restrictions.  in  addition  to  the 
problem  of  equality  inference  rules  producing  a great  many 
consequences,  there  is  the  additional  problem  that  they  tend  to 
produce  the  same  consequences  in  many  different  ways, 
representing  different  permutations  in  the  order  in  which 
operations  are  performed.  Such  multiple  derivations  are  often 
eliminated  in  the  case  of  theorem  proving  without  equality  by  us' 
of  ordering  restrictions.  Some  of  the  restrictions  we  present 
here  for  equality  have  the  flavor  of  extending  ordering 
restrictions  to  equality  inference  rules. 

Several  completeness  preserving  restrictions  on  the  use  of  » 
substitution  can  be  used  to  eliminate  some  of  the  multiple 
derivations  of  terms. 

Restriction  The  first  restriction  consists  of  requiring  » 

substitution  from  an  axiom  (an  assertion  in  the  original  set  of 
assertions  as  opposed  to  an  assertion  which  is  the  negation  of  a 
higher  goal)  to  substitute  the  first  term  for  the  second  and  to 
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never  substitute  the  second  for  the  first.  Schematically,  the 


axiom  a*b  or  a . . . a Ajj,  ■*  a“b  will  be  permitted  to  substitute 
(an  instance  of)  term  a for  term  b in  some  literal,  but  not  term 
b for  term  a.  Clearly,  preservation  of  completeness  requires 
that  both  substitution  possibilities  be  available.  This  is  done 
by  including  in  the  axioms  both  a=b  and  b=a  if  a differs  from  b. 
Since,  in  general,  both  variants  are  required  for  completeness  in 
every  case  where  a differs  from  b,  this  restriction  does  not 
decrease  the  number  of  operations  that  can  be  performed  except  in 
the  case  where  a is  identical  to  b.  In  the  latter  case,  v 
substitution  by  a>a  (e.g.,  « substitution  by  functionally 
reflexive  axiom  f{xy)=f(xy))  will  produce  fewer  identical  results 
since  it  will  now  only  try  to  substitute  the  left  f(xy)  for  the 
right  f(xy)  and  not  vice  versa.  Aside  from  this  reduction  in  the 
number  of  operations  in  the  case  where  a is  identical  to  b,  the 
principal  advantage  of  this  restriction  is  that  the  two 
substitution  possibilities  (a  for  b,  and  b for  a)  can  be 
distinguished  and  differentially  employed  in  the  search  for  a 
solution.  For  example,  in  group  theory  problems,  the  axiom 
x>f(xe)  (X  equals  x times  the  right  identity  e)  can  be  used  in 
two  ways:  replacing  a term  of  the  form  f(ae)  by  a or  replacing  a 
term  of  the  form  a by  f(ae)  for  arbitrary  a.  Since  the  first 
possibility  is  a case  of  simplifying  the  expression,  and  the 
second  makes  the  expression  more  complex,  the  first  usage  of  the 
axiom  x«f(xe)  is  generally  preferred  to  the  second.  The  two 
possibilities  are  readily  distinguished  using  this  restriction  by 


r 

v.’riting  both  axioms  x=£(xe)  and  £(xe)>x.  Since  systems  such  as 
PSTP  operate  on  chains  (clauses,  axioms,  etc.)  more  than  on 
literals  or  terras,  the  alternative  substitution  possibilities  are 
very  readily  distinguished  by  using  this  restriction. 

The  choice  of  whether  axiom  a=b  is  to  represent  the 
substitution  of  a for  b,  or  b for  a,  is  clearly  an  arbitrary  one 
since  either  convention  would  work  equally  well.  The  convention 
used  (namely,  that  a>b  denotes  the  substitution  of  a for  b)  was 
chosen  for  its  mnemonic  value.  Just  as  w reduction  by  A •»  B 
establishes  as  a subgoal  of  B the  goal  A,  a«b  establishes  as  a 
subgoal  of  a literal  containing  b a goal  containing  a.  Thus,  a>b 
is  rather  like  A ♦ 3 only  it  operates  on  terras  rather  than 
literals. 

i*  The  second  restriction,  as  opposed  to  the 
first,  has  a substantial  impact  on  reducing  the  number  of 
alternate  derivations  of  terms  by  « substitution.  This 
restriction  results  from  the  observation  that  structure  is 
introduced  into  a term  by  two  different  methods.  The  first 
method  is  « substitution  from  a functionally  reflexive  axiom. 

This  is  the  standard  structure  introducing  operation,  required 
for  completeness  as  shown  by  the  theorem  lifting  completeness  of 
the  V procedure  with  equality  from  the  ground  level  to  the 
general  level.  The  second  method  involves  i substitution  from  an 
axiom  which  is  not  a functionally  reflexive  axiom  which 
instantiates  a variable'of  the  term  being  substituted  into. 

An  example  of  each  method  involves  derivations  from  the  term 
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£(xa).  , substitution  by  the  functionally  reflexive  axiom 

f(xy)»f{xy)  results  in  f(f(xy)a),  replacing  the  x of  the  original 
term  by  f(xy)  introducing  the  additional  structure  f(...).  w 
substitution  by  the  associativity  axiom  f ( xf { yz) ) =f ( f ( xy) z) 
results  in  f(xf(ya)),  replacing  the  x of  the  original  term  by 
f(xy)  introducing  the  additional  structure  f{...)  (just  like  * 
substitution  by  the  functionally  reflexive  axiom)  and  then 
rearranging  (parentheses  of)  the  term. 

Note  that  the  terra  £(xf(ya))  derived  by  « substitution  by  the 
associativity  axiom  is  also  derivable  by  first  v substituting  by 
the  functionally  reflexive  axiom  and  then  « substituting  by  the 
associativity  axiom.  Thus,  there  are  two  alternate  derivations 
of  f (xf (ya) ) . 

The  purpose  of  this  restriction  is  to  eliminate  one  of  these 
derivations  by  restricting  structure  introducing  operations  to 
the  functionally  reflexive  axioms. 

The  restriction  is  that,  except  in  the  case  of  « substitution 
by  a functionally  reflexive  axiom,  no  variable  of  the  term  being 

I 

1 substituted  into  may  be  instantiated  by  the  « substitution 
operation  to  a term  which  is  neither  a variable  nor  a constant. 
Any  « substitution  operation  which  did  so  instantiate  a variable 
must  be  performed  on  the  modified  term  after  a sequence  of  * 
substitution  operations  by  functionally  reflexive  axioms. 

As  well  as  decreasing  the  number  of  alternative  derivations  of 
a term,  this  restriction  like  the  previous  one  has  some  heuristic 
value  since  it  makes  operations  which  introduce  structure  (* 
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substitution  from  functionally  reflexive  axioms)  and  operations 
which,  for  example,  rearrange  structure  such  as  associativity  and 
commutativity  readily  distinguishable. 

Restriction  The  third  restriction,  the  marking 
restriction,  is  the  most  sophisticated  of  the  3 restrictions 
presented  here.  It  is  an  attempt  to  achieve  the  same  effect  of 
reducing  the  number  of  alternate  derivations  when  equality 
inference  rules  are  used  as  is  achieved  by  chain  or  clause 
ordering  rules  in  the  absence  of  equality. 

Chain  or  clause  ordering  rules  impose  restrictions  on  the 
order  in  which  literals  may  be  resolved  upon,  depending  on  their 
lexicographic  value  (A-ordering)  or  their  inherited  position 
(C-ordering,  used  in  the  model  elimination  derived  procedures). 
The  marking  restriction  imposes  an  ordering  restriction  on 
substitution  operations  on  terms. 

Carelessly  stated,  the  marking  restriction  prohibits  a 
substitution  operation  on  a term  lexically  to  the  left  of  a term 
previously  substituted. 

An  illustration  of  the  potential  effectiveness  is  the 
following . 


2j,19. 


1.  Pabc 

2.  a«d 

3.  b«e 

4.  c» f 

5.  Pdef 


theorem 


There  are  6 proofs  of  the  theorem,  varying  only  in  the  order  in 
which  substitution  from  the  axioms  is  performed. 


6.1.  (Pdef)  Paef 

7.1.  [Pdef]  IPaef)  Pabf 


« substitution  from  2 
« substitution  from  3 
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8.1. 

[Pdef] 

(Paef  ] 

(Pabf] 

Pa  be 

W 

substitution 

from 

4 

9.  1. 

V 

reduction  by 

1 

6.  2. 

(Pdef] 

Paef 

tl 

substi tution 

from 

2 

7.  2. 

(Pdef  i 

(PaefJ 

Paec 

V 

substi tution 

-from 

4 

b.  2. 

(Pdef] 

(Paef] 

(Paec] 

Pabc 

ir 

suosti tution 

-from 

3 

9.  2. 

f 

reduction  by 

1 

6.  3. 

[Pdef] 

Pdbf 

It 

substi tution 

from 

3 

7.3. 

(Pdef] 

(Pdbf] 

Pabf 

It 

substi tution 

■from 

2 

8.3. 

(Pdef] 

(Pdbf] 

(Pabf] 

Pabc 

V 

substitution 

from 

4 

9.  3. 

0 

ir 

reduction  by 

1 

6.4. 

(Pdef] 

Pdbf 

T 

substi tution 

from 

3 

7.  4. 

(Pdef] 

(Pdbf] 

Pd  be 

W 

substi tution 

-from 

4 

8.  4. 

(Pdef] 

(Pdbf] 

(Pdbc] 

Pabc 

IT 

substitution 

from 

2 

9.4. 

0 

IT 

reduction  by 

1 

6.5. 

(Pdef] 

Pdec 

It 

substitution 

■from 

4 

7.5. 

(Pdef] 

(PdeC] 

Paec 

rt 

substi tution 

from 

2 

8.5. 

(PdefJ 

(Pdec] 

(Paec] 

Pabc 

¥ 

substi tution 

from 

3 

9.  5. 

0 

¥ 

reduction  by 

1 

6.  6. 

(Pdef] 

Pdec 

¥ 

substi tution 

■from 

4 

7.  6. 

(Pdef] 

(Pdec] 

Pdbc 

¥ 

substi tution 

■from 

3 

8.  6. 

(Pdef) 

(Pdec] 

(Pdbc] 

Pabc 

¥ 

substitution 

from 

2 

9.  6. 

0 

j 

¥ 

reduction  by 

1 

Thus 

, there 

are  6 

occurrences  of 

: the  empty  chain  at 

level  4 

24  derived  chains  in  all  (disregarding  the  possibility  of 
subsumption  being  used  to  eliminate  identities). 

.If  the  marking  restriction  is  used,  then  the  only  totally 
acceptable  proof  is  the  .first  one  since  it  is  the  only  one  that 
substitutes  for  the  terms  d,e,f  in  strict  left  to  right  order. 
Formation  of  the  following  -chains  is  prohibited  by  the  marking 
restriction  since  each  requires  substi tution -for  a term  to  the 
•left  of  a previously  substituted  term:  8.2,  7.3,  8.4,  7.5,  7.6. 
Thus,  at  level  4 there  is  only  1 occurrence  of  the  empty  chain 
and  11  derived  -chains  in  all,  a 54%  reduction. 

We  will  now  present  the  marking  restriction  in  greater  detail.. 
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1.  (marked  term  usage  restriction)  Each  term  in  the  last 
literal  of  a chain  may  be  marked.  No  j substitution  operation 
from  a higher  goal  (A-literal)  or  from  an  axiom  can  be  performed 
on  a marked  term  directly.  For  example,  if  c=d  is  an  axiom  and 
d'  a term  of  the  last  literal  of  the  chain  where  d and  d'  are 
unifiable,  no  marked  occurrence  of  d’  in  the  literal  can  be  w 
substituted  by  c^d.  Marked  occurrence  of  d'  may  be  used  in  » 
substitution  operations  from  higher  goals  or  axioms  if  they  are 
subterms  of  the  unmarked  term  being  matched  in  the  t substitution 
operation . 

2.  (acquisition  of  marks)  Each  term  totally  to  the  left  of  the 
term  matched  in  a « substitution  operation  from  a higher  goal  or 
axiom  is  marked.  If  a term  is  marked,  so  are  all  its  substerms. 

3.  (retention  of  marks  of  unmatched  terms)  Marks  are  retained 
by  all  marked  terms  not  matched  in  a « substitution  operation. 

If  the  marked  term  is  a variable  which  is  instantiated  by  the  w 
substitution  operation,  its  value  is  marked  at  the  marked 
occurrence . 

4.1.  (retention  of  marks  of  matched  terms)  The  conservative 
approach  is  to  eliminate  all  marks  of  terms  matched  in  a i 
substitution  operation  from  higher  goal  or  axiom.  This 
conservative  marking  restriction  still  imposes  a left  to  right 
order  in  performing  w substitution  operations  on  disjoint  terms, 
but  wipes  out  all  marks  when  a superterm  is  v substituted.  An 
illustration  of  why  this  approach  is  too  conservative  relates  to 
the  use  of  commutative  and  similar  axioms.  Every  term 
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immediately  preceded  by  *' ! “ or  contained  in  a term  immediately 
preceded  by  " ! " is  marked. 

2^20. 

1.  Pf(ab) 

2.  a=c 

3.  b=d 

4.  f(xy)»f(yx) 

5.  Pf(dc)  theorem 

6.  Pf{!da)  ^ substitute  from  2 

7.  Pf(ad)  , substitute  from  4,  destroy  marks 

d.  P£(!ab)  « substitute  from  3 

9.  0 t reduce  by  1 


A consequence  of  destroying  the  marks  in  chain  7 using  the 
conservative  approach  is  that  the  marking  restriction  had  no 
effect  in  reducing  the  number  of  possible  proofs.  It  is 
desirable  to  strengthen  the  marking  restriction  to  allow  for 
inheritance  of  marks  through  » substitution  operations. 

We  illustrate  this  idea  using  the  same  example.  Here,  terms 
which  are  marked  when  matched  to  the  variables  y and  x of  f(yx) 
of  axiom  4 retain  their  marks  when  substituted  in  f(xy). 

Example  2^21. 


5.  Pf(dc) 

6.  Pf(!da) 

7.  Pf(aJd) 


theorem 

« substitute  from  2 

X substitute  from  4,  marks  inherited 


Chain  8 can  no  longer  be  derived  since  it  would  required  « 
substitution  on  the  marked  term  d.  Remaining  valid  deductions 
are 


5.  Pf(dc) 

theorem 

6.  Pf(bc) 

« substitute  from 

3 

7.  Pf(cb) 

« substitute  from 

4 

8.  Pf(ab) 

» substitute  from 

2 

9.  ^ 

« reduce  by  1 

5.  Pf(dc) 

theorem 

6.  Pf(cd) 

V substitute  from 

4 

7.  Pf(ad) 

» substitute  -from 

2 

8.  'Pf  ( lab) 

» substitute  'from 

3 
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9.  0 » reduce  by  1 

Inheritance  of  marks  is  not  always  so  str aiqhbforward  as  is 
suggested  oy  the  example  using  the  commutative  axiom.  Freely 
inherited  marks  may  result  in  blocking  all  proofs. 

Example  2.22. 


1.  Pg{de) 

2.  c=b 

3.  g(xx)=>f(xc) 

4.  d=a 

5.  e*a 


6.  Pf(ab) 

theorem 

7.1.  Pf  ( ! ac ) 

¥ 

substi tute 

from 

2 

7.2.  Pg(!a!a) 

¥ 

substi tute 

■from 

3 

(blocked) 

8.1.  Pf(db) 

¥ 

substi tute 

from 

4 

8.2.  Pf(!dc) 

¥ 

substi tute 

from 

2 

8.3.  Pgddld) 

¥ 

substitute 

■from 

3 

(blocked) 

9.1.  Pf(eb) 

¥ 

substitute 

from 

5 

9.2.  -Pfdec) 

¥ 

substi tute 

■from 

2 

9.3.  Pg(!ele) 

¥ 

substi tute 

from 

3 

(blocked ) 

The  problem  here  is  that  b must  be  w substituted  to  c in  order  to 
apply  necessary  axiom  3.  a,  or  whatever  a has  already  been  » 
substituted  to,  is  then  marked.  When  two  copies  of  it  are 
formed,  both  are  marked  prohibiting  the  » substitution  of  each, 
eliminating  any  possibility  of  » reduction  by  1. 

4.2.  (better  retention  of  marks  of  matched  terms)  It  is 
apparent  that  marks  may  be  validly  retained  when  there  is  only 
one  occurrence  of  the  variable  in  the  left  side  of  the  equality 
(such  as  in  the  case  of  the  commutative  axiom).  Thus,  marks  may 
be  transmitted  through  each  variable  of  the  equality  for  which 
there  is  only  one  ocurrence  of  the  variable  in  the  left  side. 
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The  problem  of  the  most  recent  example  refuting  the  general 
transmission  of  marks  through  variables  of  the  equality  can  be 
overcome  by  specifying  that  marks  may  be  transmitted  through  at 
most  one  occurrence  of  e<  ch  variable  in  the  left  side  of  the 
equality.  A consistent  convention  (such  as  transmitting  marks 
only  through  the  leftmost  occurrence  in  the  left  side)  must  be 
adopted.  Thus,  in  the  previous  example,  only  one  of  the  two  a' s 
of  7.2,  one  of  the  two  d's  of  8.3,  and  one  of  the  two  e‘s  of  y.3 
would  be  marked.  This  would  allow  (assuming  the  leftmost 
variable  occurrence  retains  marks)  6.3  (by  , substituting  first 
a=d,  then  e=a)  to  be  extended  to  a complete  proof.  Both  other 
deductions  remain  blocked.  This  lengthened  proof  (longer  than 
the  shortest  proof  with  no  marking  restriction)  can  be  shortened 
by  specifying  no  variable  occurrence  in  the  left  side  of  the 
equality  transmits  marks  i'f  thet.e  is  more  than  one  occurrence  of 
the  variable  in  thet'left  side.  However,  the  cost  of  discovering 
such  lengthened  proofs  should  generally  be  overcome  by  the 
lowered  cost  o>f  searching  -for  a proof  provided  by  the  stronger 
marking  restriction.  Note  however  that  this  stronger  marking 
restriction  may  require  the  use  of  equality  substitutions  (a=d  in 
the  example)  not  otherwise  necessary  and  more  equality  literals 
may  need  to  be  included  if  restriction  1 is  used.  A more  general 
comment  is  that  transmission  of  marks  through  one  of  the  n>l 
variable  occurrences  of  the  left  side  reauires  the  symmetry  of 
substitution,  i.e.,  if  a can  be  substituted  to  b then  b -can  be 
suostituted  to  a. 
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An  inefficient  form  of  the  marking  restriction  was  implemented 


in  an  earlier  version  of  PSTP  by  Don  Cohen  in  June  l‘<75.  It  was 
tested  on  an  .equality  based  formulation  of  tne  group  theory 
problem:  if  the  square  of  every  element  is  the  identity  element 
then  the  group  is  Abelian.  Several  equalities  are  given  twice, 
once  in  reversed  order,  since  restriction  1 was  used. 

1.  c=f(ab) 

2.  e»f(xx) 

3.  x*f(xe) 

4.  x=f(ex) 

5.  f(xx)=e 

6.  >f(xe)=x 

7.  <f(ex)=x 

8.  f (xf  (yz)  )=f  (f  (xy)  z) 
y.  f ( f ( xy)  z)  =>f  ( xf  { yz)  ) 

10.  c=f(ba)  theorem 

w substitution  from  the  axioms  was  performed  exhaustively  up 

to  level  2.  The  A-literal  identical  to  following  B-literal  chain 

deletion  rule  was  used.  The  number  of  chains  retained  with  and 

without  the  marking  restriction  is  given  in  the  table  below. 

Level  0 Level  1 Level  2 

without  marking  I 8 ~ "llu””” 

with  marking  1 8 84 

The  .1  ilevel  0 chain  is  the  theorem.  Tne  marking  restriction 

makes  no  difference  at  level  1 since  the  theorem  has  no  marks  to 

restrict  the  number  of  « substitution  operations  which  can  be 

performed.  Marking  only  has  an  effect  at  the  level  2 and  above. 

The  marking  restriction  resulted  in  a 15%  reduction  in  the  number 

of  chains  at  level  2. 

In  addition,  several  chains  at  level  2 were  « substituted  with 
and  without  the  marking  restriction.  The  following  table 
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expresses  the  number  of  chains  resulting  w substituting  each  of 
the  specified  level  2 chains. 


Ic=f (ba) ] 

(f (ec) “f (ba) ) 

f (f (xlxl)c)=f (ba) 

without 

33 

33 

lc=f (ba) ] 

(f  (ec)  = f (ba)  j 

1 f ( ec) =f ( 1 bf ( ae) ) 

20 

10 

Ic=f (ba) ) 

If (ec) =f (ba) 1 

! f (ec)  =f  ( f ( be)  a) 

20 

12 

lc=f  (ba)  1 

[f (ec)=f (ba) j 

!f  (ec)=f  (f  (ba)e) 

20 

12 

(c®f  (ba)  1 

[f (ec) =f (ba) 1 

f ( lef (ce) )=f (ba) 

20 

17 

lc=f(ba) ] 

[f (ec)=f (ba) 1 

f (f  (ee)c)*f  (ba) 

19 

19 

[c=f (ba) ] 

[f  (ec)*f  (ba)  ) 

'f  ('f  ( ec)  e)  =»f  (ba) 

20 

20 

(c=f (ba) ] 

If (ec)=f (ba) ) 

!f  (ec)=“f  ( !:bf  (ea)  ) 

20 

10 

Ic=f (ba) ] 

If (ec)=f (ba) j 

!f  (ec)=f  (f  (eb)a) 

20 

12 

(c»f  (ba)  ] 

If (ec) =f (ba) ] 

!f (ec) »f (ef (ba) ) 

20 

12 

The  marking  restriction  resulted  in  a 0%  (when  no  marks  were 
present)  to  50%  reduction  in  the  number  of  chains  with  a mean  of 
26%  reduction.  The  latter  figure,  which  substantially  exceeds 
the  15%  reduction  at  level  2,  confirms  our  expectation  that  the 
marking  restriction  becomes  more  effective  at  higher  levels  due 
to  the  increased  complexity  of  the  term  being  « substituted. 

The  remaining  high  branching  factor  for  the  * substitution 
operation  even  in  the  presence  of  the  marking  restriction 
indicates  that  although  it  may  substantially  reduce  the  number  of 
generated  chains,  it  is  not  a panacea  to  the  general  problem  of 
too  many  chains  being  generated. 

A reasonable  question  is  whether  the  marking  restriction  can 
be  extended  to  retention  of  marks  during  other  inference  (e.g.,  « 
reduction  by  an  axiom)  operations.  The  proposed  extension  would 
(where  applicable)  cause  a marked  term  to  remain  marked  in  every 
location  where  it  was  substituted.  (Presently,  the  only 
operations  which  preserve  marks  are  » substitution  from  an  axiom 
and  ir  substitution  from  a higher  negated  goal  from  which  the 
marked  literal  is  not  itself  derived  solely  by  ■»  substitution.) 


1 

s 

;i 
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The  following  example  demonstrates  the  incompleteness  of  tne 
proposed  extension  to  the  marking  restriction. 

1.  b>a 

2.  Cx  A Rx  ♦ Pxb 

3.  Rd 

4.  c=d 

5.  Oc 

6.  Pxa  theorem 

7.  [Pxa]  Plxb  « substitution  from  1 (x  is  marked) 

d.  [Pxaj  [Pxb]  Olx  R!x  w reduction  by  2 (x's  retain  marks 

according  to  proposed  extension) 

Si.  [Pda]  [Pdb]  Old  ir  reduction  by  3 

10.  [Pda]  [Pdb]  [Qd]  Qc  w reduction  by  4 

11.  0 n reduction  by  5 

The  reason  this  proof  is  invalid  is  that  chain  9 cannot  be 
formed  when  the  marking  restriction  is  applied  since  d of  Od  is 
mar  ked . 

There  are  alternate  proofs,  i.e.,  where  x is  « substituted  to 
c or  d before  a is  ir  substituted  to  b.  These  other  proofs  are 
also  blocked  since,  after  a is  « substituted  to  b,  the  value  of  x 
(c  or  d)  is  marked  yet  must  be  * substituted  again  to  handle  « 
reduction  by  both  Qc  and  Rd . 

We  will  next  explore  an  alternate  method  'for  controlling  this 
excessive  generation  of  chains:  demodulation,  which  is 
unfortunately  not  compatible  with  our  marking  restriction  since 
they  are  so  totally  different  in  philosophy  and  application. 

2.13.2.  Demodulation.  Demodulation  (developed  by  Wos  et  al 
[54])  offers  a partial  solution  to  the  explosive  growth  in  the 
number  of  partial  deductions  introduced  by  use  of  the  equality 
predicate . 

We  will  first  introduce  some  definitions. 
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A njodulation  list  is  a list  a2=b2,  .... 
aj^=bj^  oi  equality  literals. 

i^£iiDi£i2D  2j,ii.  If  E is  an  expression  containing  an  instance 
of  term  b,  and  E ' is  E with  a single  instance  of  b replaced  by  a, 

tnen  E'  is  an  imisaiite  EodylaQi  of  E relative  to  any  mooulation 
list  containing  a=b. 

2*  Expression  E'  is  a iSr2!22iJi3D£  expression 

E relative  to  modulation  list  D if  there  exists  a sequence  of 
immediate  modulants  Ej^,  Ej^  relative  to  D such  that  Ei=E, 

Ej,=E ' , and  n<k.  Note  that  E is  a x-modulant  of  E. 

fi§£ini2i2D  A demodulation  list  is  a modulation  list 

such  that  for  each  equality  literal  a=b  in  tne  list,  a has  fewer 
symbols  (is  simpler)  than  b. 

We  could  relax  the  restriction  on  a having  strictly  fewer 
symbols  than  b by  adopting  complexity  measures  similar  to  those 
used  in  complete  sets  of  reductions.  Using  appropriately  defined 
complexity  measures,  f(f{xy)2)  could  be  replaced  by  f{xf(y2))  and 
g(f(xy))  could  be  replaced  by  f (g(y)  g(x)  )..  If  complete  sets  of 
reductions  are  used,  we  are  also  assured  that  demodulation  is 
deterministic,  a desirable  property. 

We  could  also  allow  equalities  with  an  equal  number  of  symbols 
on  each  side  (such  as  the  commutativity  axiom  f(xy)»f(yx))  and 
redefine  demodulation  to  select  one  of  the  set  of  all  demodulants 
witn  minimal  complexity. 
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P®liDi£i2D  Expression  E*  is  a deaiociulant  of  expression 


£ relative  to  demodulation  list  D if  it  is  a k-niodulant  of  E 
relative  to  D for  some  k and  nas  no  immediate  modulant  relative 
to  D. 

In  a resolution  theorem  proving  system,  demodulation  is  used 
to  produce  the  demodulated  form  of  each  newly  derived  clause. 

The  demodulation  list  is  a list  of  equality  consequences  of  the 
set  of  axioms.  When  the  literals  of  two  clauses  which  are  being 
resolved  are  unified  against  each  other,  provision  must  be  made 
for  the  possibility  tnat  each  is  the  demodulated  form  of  more 
complex  literals  which  actually  must  be  unified.  Hence, 
k-modulants  of  each  of  the  literals  must  be  unified  wnere  k is 
generally  unbounded.  The  demodulated  literal  is  regarded  as 
representing  all  literals  from  which  it  could  be  produced  by 
demodulation. 

In  contrast  to  other  equality  restrictions  such  as  marking, 
the  use  of  demodulation  is  not  restricted  to  cases  in  which 
equality  inference  rules  are  being  used.  Demodulation  is  often 
used  to  simplify  terms  even  when  resolution  is  the  sole  inference 
system . 

There  are  several  reasons  -for  the  proposed  and  experimentally 
noted  success  of  demodulation: 

1.  demodulation  retains  terms  in  their  simplest,  most  likely  to 
be  useful  form, 

2.  reducing  the  variety  of  terms  by  demodulation  often  results  in 
extra  subsumption  operations  being  successful. 
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3.  reducing  the  variety  of  terms  by  demodulation  may  result  in 
extra  tautology  or  other  similar  deletion  operations  being 
successful,  and 

4.  demodulation  is  often  implemented  in  an  incomplete  system 
(terms  are  not  fully  k-modulated  back  upward  in  complexity  in 
matching)  resulting  in  -fewer  clauses  being  produced. 

The  great  success  of  demodulation,  often  permitting  the 
solution  of  problems  not  otherwise  solvable,  conceals  the  fourth 
reason  almost  totally. 

Demodulation  has  generally  been  presented  with  little  formal 
justification  (perhaps  because  though  it  is  not  obvious  that  it  ] 

works  it  is  at  least  intuitive).  However,  it  is  necessary  for  us  | 

to  justify  its  use  a little  more  -formally  tnan  usual  when 

applying  it  to  the  , procedure.  We  need  to  veri-fy  that  the  j 

higher  goal  complementary  to  lower  subgoal  deletion  rule  is  still  | 

I 

I 

valid  for  demodulated  deductions.  It  is  not  obvious  that  it  is 
since  it  is  likely  that  different  occurrences  of  the  same 
demodulant  really  refer  to  different  literals.  Eliminating 
deductions  with  complementary  demodulants  may  eliminate  all 
proofs . 

2iJ£2£S!S  Substitution  by  implied  equivalence  preserves 

consistency  for  ground  formulas.  If  S*-A*^  then  S{B*A}  (S  with  B 
uniformly  replaced  by  A)  is  consistent  (inconsistent)  if  S is 
consistent  (inconsistent)  where  S is  a ground  formula,  A and  B 
are  ground  atomic  formulas. 

Proof. 
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Case  1.  S is  consistent.  Then  S has  a model  M.  Since  S>-A*>3 
by  hypothesis,  M assi'jns  the  same  truth  value  to  A and  B.  Tne 
substitution  of  A tor  B in  S will  therclore  still  have  model  M. 
Therefore,  since  it  has  a model,  S(i3*-A)  is  consistent. 

Case  2.  S is  inconsistent.  Then  S by  uefinition  has  no 
model.  Hence,  it  also  has  no  model  which  assigns  the  same  truth 
value  to  A and  B,  and  the  substitution  of  A for  B in  S will  not 
result  in  a consistent  formula.  Therefore,  S{B*A}  is 
inconsistent.  G 

To  facilitate  the  proof  of  the  'following  tneorem,  we  will  make 
one  plausible  assumption  about  demodulation — that  demodulation  is 
deterministic  and  that  the  same  formula  will  not  be  demodulated 
in  two  different  ways  in  the  same  proof. 

2.8.  Demodulation  is  sound  and  complete  for  the  » 

procedure . 

£I22^-  Let  S be  an  inconsistent  set  of  clauses.  Let  C be  any 
essential  clause,  i.e.,  S-C  is  consistent.  Then  by  Herbrand's 
theorem  there  is  a minimally  inconsistent  set  of  ground  clauses 
S'  each  element  of  which  is  an  instance  of  a clause  of  S and  some 
clause  C contained  in  S ' is  an  instance  of  C. 

Let  Si-Ajsr^Sj^,  S'"A2**B2.  ...»  Then  let  Sj^  be 

S'IAj^»Bi},  SJ  be  Si{A2‘'-B2}r  ...»  be  {An-Enl  . Similarly, 

let  be  ( . . . ( (C  {Aj^-Bj^})  {A2*B2})  . . .)  {Ajj*Bj^)  . These  are  all 
atomic  formula  replacements  at  the  ground  level  effected  by  a 
demodulation  list. 

Then  by  the  previous  theorem  is  inconsistent  and  S^-Cp  is 
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consistent.  This  satisfies  the  ceauirements  for  application  of 


the  , procedure.  Hence,  there  is  a , proof  of  ->Cf^  from  S/,. 

This  , proof  can  be  altered  to  be  a » procedure  with 
demodulation  proof  of  -•€ ' from  S'  by  incorporating  the 
....  equivalences  in  the  literal  matching  process.  The 

resulting  proof  does  not  violate  the  higher  goal  complementary  to 
lower  goal  subgoal  restriction  (if  the  proof  of  does  not) 
since  (by  the  assumption  of  demodulation  being  deterministic) 
identical  goals  in  the  proof  of  -iC ' must  correspond  to  identical 
goals  in  the  proof  of  ”«C^;  when  transforming  the  proof  from  one 
proving  -*C^  to  one  proving  -•C  * , the  atomic  formulas  in  the  proof 
can  only  get  more  distinct — no  new  identities  are  produced. 

The  proof  of  -iC  ' from  S'  can  easily  be  lifted  to  a pro:  f of  -»C 
from  S by  the  « procedure  with  demodulation. 

The  » procedure  with  demodulation  is  complete  by  the 
demonstration  of  the  existence  of  this  proof.  It  is  sound  since 
substitution  by  implied  equivalences  (of  which  demodulation  is  an 
example)  is  consistency  preserving,  and  by  the  soundness  of  the  » 
procedure  applied  to  S^ . U 

2.14.  Use  of  models 

The  use  of  models  should  be  an  important  component  of 
deductive  systems.  They  can  substantially  reduce  computational 
effort  by  indicating  which  goals  cannot  possibly  be  proved. 

Since  pairs  of  individually  complete  theorem  proving 
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refinements  are  not  necessarily  complete  when  used  in 
combination,  and  it  is  well  known  that  the  combined  semantic  and 
linear  resolution  refinements  are  incomplete,  it  is  worth 
considering  whether  there  is  a usage  of  models  which  is  complete 
when  used  with  the  » or  MESON  procedures. 

VJe  will  start  by  describing  the  most  successful  use  of  models 
in  deduction  to  date,  the  Geometry-Theorem  Proving  Machine. 

Then,  after  giving  a brief  taxonomy  of  models,  we  will  indicate 
why  direct  application  of  Geometry-Theorem  Proving  Machine  model 
ideas  will  not  work  for  the  « or  MESON  procedures. 

It  will  be  shown  that  there  are  circumstances  in  which  it  is 
necessary  to  attempt  to  prove  subgoals  in  the  » or  MESON 
procedures  which  are  not  only  false  in  a model,  but  which  are 
actually  disprovable  from  the  axioms.  We  will  then  present  a 
possible  solution  in  the  form  of  a modified  ir  or  MESON  procedure. 

2.14.1.  Geometry-Theorem  Proving  Machine.  Gelernter  et  al . ' s 
Geometry-Theorem  Proving  Machine  (GTM)  {13,14,15,16]  is  a theorem 
proving  program  for  Euclidean  plane  geometry.  The  deductive 
portion  of  GTM  functions  entirely  in  the  (decidable,  see  Tarski 
(501)  domain  of  Euclidean  plane  geometry  which  excludes 
constructions.  An  additional  facility  in  GTM  permits 
constructions  when  GTM  cannot  find  a solution  without  one. 

GTM  is  a pure  problem  reduction  method  based  theorem  proving 
program.  It  is  presented  a list  of  hypotheses  and  a conclusion 
expressed  in  terms  of  geometric  predicates  and  points.  The 
latter  are,  in  a more  theoretical  sense,  .Skolero  constants.  GTM 
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then  works  backward  'from  the  conclusion  to  the  hypotheses  and 
known  identities  using  theorems  of  geometry.  The  primary 
heuristic  in  GTH  is  the  presence  of  a model  to  reject  false 
goals.  The  model  takes  the  form  of  a geometric  diagram  from 
which  measurements  are  taken  and  is  constructed  so  as  to  satisfy 
the  premises  of  the  theorem  to  be  proved. 

The  notation  employed  by  Gelernter  (13]  in  describing  the 
operation  of  GTM  is  somewhat  different  from  that  which  is 
currently  used  to  describe  problem  reduction  method  systems.  In 
his  terminology,  the  application  of  a theorem  to  a subgoal 
generates  a single  subgoal.  This  subgoal,  however,  may  be  a 
conjunctive  set  of  terras  none  of  which  separately  is  considered 
to  be  a subgoal..  Solution  of  each  term  constitutes  solution  of 
the  subgoal.  The  following  description  of  GTM  will  use  a more 
contemporary  representation  for  subgoals  and  dismiss  this 
notational  distinction  between  terms  and  subgoals. 

.In  GTM,  immediate  subgoals  of  the  current  goal  are  generated 
in  breadth  first  fashion.  More  precisely,  in  order  to  solve  a 
single  subgoal,  all  applicable  theorems  are  attempted  in 
parallel.  Each  successful  application  of  a theorem  (the 
consequent  of  the  theorem  matches  the  subgoal)  results  in  the 
formation  of  a conjunctive  set  of  subgoals  whose  successful 
simultaneous  solution  constitutes  a solution  for  the  original 
subgoal . 

GTM  does  not  allow  unbound  variables  in  its  problem  solving 
graph.  Thus,  the  next  step  is  to  replace  each  set  of  conjunctive 
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subgoals  having  free  variables  by  all  alternative  conjunctive 
sets  with  variables  replaced  by  points  specified  in  the  proolem 
statement . 

Subgoals  are  then  subjected  to  a test  to  determine  whether 
they  are  true  in  the  diagram.  Sets  of  subgoals  containing 
subgoals  false  in  the  diagram  are  rejected.  (Actually,  the  two 
steps  of  (1)  instantiating  variables  by  points  in  all  possible 
ways  and  (2)  rejecting  conjunctive  sets  of  subgoals  containing  a 
false  subgoal  can  be  performed  partially  as  one  step  with  the 
analysis  of  the  diagram  actually  proposing  points  rather  than 
taking  all  instantiations  of  the  subgoals  and  testing  each  one.) 

Generated  sets  of  subgoals  are  then  subjected  to  several  tests 
to  determine  if  they  should  be  added  to  the  subgoal  tree 
("problem  solving  graph").  These  rules  ate  somewhat  heuristic  in 
nature  and  could  be  improved  by  the  use  of  model  elimination  or 
MESON  procedure  acceptability  and  subsumption  rules. 

.If  B is  a subgoal  of  A and  has  no  partner  goals,  no  other 
conjunctive  set  of  goals  containing  .either  B or  a symmetric 
variant  of  B will  be  added  to  the  problem  solving  graph  as 
subgoals  of  A.  The  solution  of  the  original  B in  the  problem 
solving  graph  is  implied  by  the  solution  of  any  of  the  other  sets 
of  subgoals  containing  B and  is  alone  sufficient  to  solve  A. 

This  is  really  just  a subsumption  condition. 

No  conjunctive  set  of  subgoals  will  be  added  to  the  problem 
solving  graph  if  this  would  result  in  a subgoal  being  identical 
to  or  a symmetric  variant  of  an  ancestor  goal.  This  is 
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essentially  equivalent  to  the  model  elimination  ar  MESON  ' 

procedure  statement  that  chains  with  an  A-literal  or  niqher  goal 
followed  by  an  identical  B-literal  or  lower  goal  can  be  discarded 
without  loss  of  completeness. 

A third  rule  eliminates  the  addition  of  a subgoal  without 
partner  goals  that  is  identical  to  or  a symmetric  variant  of  a 
subgoal  without  partner  goals  that  is  a subgoal  of  an  ancestor  of 
the  proposed  added  goal.  .In  linear  MESON  procedure  terms,  the 
restriction  is  that  if  chain  lA]  B is  part  of  the  problem  solving 
graph,  chains  [A]  [C^]  ...  [Cp]  B or  [A]  (Cil  ....  [Cn]  B'  will 

not  be  added  (where  B*  is  a symmetric  variant  of  B)  . This  is, 
once  again,  a subsumption  rule.  .It  would  be  a valid  inference 
rule  in  model  elimination  or  the  MESON  procedure  in  the  absence 
of  reduction  (which  is  not  present  in  GTM)  since  the  presence  or 
absence  of  A-literals  is  then  irrelevant  to  finding  a solution. 

The  final  rule  specifies  the  rejection  in  some  cases  of 
conjunctive  sets  of  subgoals  that  are  supersets  of  other 
conjunctive  sets  of  subgoals.  .If  the  larger  is  descended  from  a 
member  of  the  smaller,  the  larger  is  rejected;  if  the  smaller  is 
descended  from  the  larger,  both  are  retained.  This  part  of  the 
rule  is  again  similar  to  the  A-literal  identical  B-literal  rule 
in  model  elimination  or  the  MESON  procedure.  .If  one  is  not 
descended  from  the  other,  it  is  retained  only  if  it  has  fewer 
elements.  This  then  is  essentially  another  subsumption  rule. 

A conjunctive  set  of  subaoals  is  rejected  if  any  of  its  i 

1 

elements  is  labeled  as  stuck.  Subgoals  become  stuck  by 
exhausting  all  alternative  proofs. 
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If  the  subgoal  is  acceptable  according  to  the  above  criteria, 
it  is  then  checked  to  see  if  it  is  laueled  os  already  established 
or  (in  certain  cases,  notatably  identities)  if  there  exists  a 
short  (one  step)  solution. 

■The  search  strategy  dictates  that  solution  of  a conjunctive 
set  of  subgoals  will  cause  GTM  to  start  working  on  the  next 
unsolved  ancestor  partner  goal. 

When  forced  to  choose  a subgoal  to  work  on,  candidates  are 
compared  to  the  hypotheses  with  respect  to  points,  connectives, 
and  predicates  and  some  distance  function  computed.  The  subgoal 
distance  score  is  then  expanded.  (Note:  this  evaluation  function 
differs  from  the  admissable  measures  of  Nilsson  (35]  as  mentioned 
with  relation  to  the  order  functions  of  PSTP  since  it  considers 
only  the  remaining  cost  to  reach  the  goal,  not  the  total  cost.) 

The  two  features  of  GTM  which  vastly  improve  its  performance 
relative  to  a more  naive  geometry  theorem  proving  machine  using 
only  the  theorems  are:  special  handling  of  certain  concepts 
(symmetry,  recognition  of  identity  of  alternately  named  angles) 
and  the  use  of  the  diagram  to  reject  false  subqoals. 

2.14.2.  Abstraction  of  Geometry-Theorem  Proving  Machine. 

Reiter  [38,39]  has  tried  to  generalize  the  notion  of  models  in 
fomal  deductive  systems.  Using  GTM  as  an  example,  ne  develops  a 
formal  system  resembling  Bledsoe  et  al . ' s [5,8]  IMPLY  which 
prunes  its  search  tree  by  evaluating  subgoals  in  a model. 

The  extension  of  a formal  system  like  Bledsoe's  IMPLY  to 
incorporate  a model  is  quite  straightforward.  It  is  only 
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necessary  to  determine  if  some  instance  of  generated  subgoals  is 
true  in  the  model.  If  all  instances  are  false,  the  subgoal  is 
rejected . 

A formal  justification  for  the  use  of  models  to  reject 
deductions  containing  false  subgoals  in  the  problem  reduction 
method  takes  the  form  of  the  following  proof  of  the  compatibility 
of  the  use  of  models  and  input  resolution.  Input  resolution  is, 
of  course,  isomorphic  to  the  problem  reduction  method,  being 
transformed  into  it  by  the  negation  of  literals  in  the  deduction. 

{Henschen  and  Wos  (20])  If  S is  a minimally 
unsatisf iable  set  of  Horn  clauses,  then  S contains  exactly  one 
negative  clause. 

Jil§2£§S  2^1,2*  S be  an  unsati sfiable  set  of  ground 

clauses.  Let  M be  a model  for  the  non-negative  clauses  of  S 
expanded  to  the  language  of  S.  Then  there  exists  an  input 
refutation  of  S such  that  the  negative  top  clause  and  every 
resolvent  is  false  in  M. 

^£22^*  Proof  by  induction  on  the  number  of  distinct  atoms  n 
in  S. 

n»l.  If  n=l  then  S = {P,  -iP ) for  some  atom  P.  The  resolution 
of  ~tP  and  P to  derive  the  empty  clause  Q is  an  input  refutation 
of  S -with  negative  top  clause  ->P  and  single  resolvent  u both 
■false  in  M.  Therefore,  the  theorem  is  true  for  this  case. 

n>l.  Let  P be  (the  atom  of)  a positive  unit  clause  in  S. 

There  must  exist  such  a clause  since  the  lack  of  a positive 
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clause  contradicts  the  unsatisfiability  of  S,  and  no  positive 

Horn  clause  may  have  mote  than  one  literal  by  definition. 

Let  be  S with  (1)  every  clause  containing  P removed  (if  S 

is  minimally  inconsistent,  only  the  clause  P will  be  removed), 

and  (2)  every  instance  of  -iP  deleted.  is  an  unsati sf iable 

Horn  set  with  fewer  than  n distinct  atoms.  Also,  M is  a model 

for  the  non-negative  clauses  of  Therefore,  by  the  induction 

hypothesis,  there  exists  an  input  refutation  of  Si  such  that  the 

; *** 

J 

negative  top  clause  and  every  resolvent  is  false  in  M. 

Let  D be  the  input  refutation  of  Sj^  such  that  the  negative  top 
clause  and  every  resolvent  is  false  in  M.  Add  -iP  to  the  top 
clause  if  the  clause  in  S /rom  which  it  was  derived  contained  -•P. 
Add  ->P  to  each  resolvent  if  the  parent  in  was  obtained  from  a 
clause  in  S containing  -P,  or  if  its  near  parent  has  had  -^P 
added.  If  the  resulting  derivation  has  -•P  as  the  result  rather 
than  the  empty  clause  Q as  before,  resolve  it  away  with  P to 
obtain  u.  The  result  is  an  input  refutation  of  S with  the 
negative  top  clause  and  every  resolvent  false  in  H.  Q 

This  is  a vitally  important  instance  of  the  more  general 
theorem  by  Henschen  [18]  relating  the  use  of  models  to  deduction 
with  Horn  sets.  His  theorem  does  not  restrict  the  model  to 
falsifying  the  negative  top  clause  of  the  deduction. 

Reiter  brings  up  some  important  issues  relating  to  the  use  of 
models  in  first  order  predicate  calculus  theorem  proving  systems. 
In  Reiter's  system  (and  nearly  all  others  that  attempt  to  work 
with  the  first  order  predicate  calculus)  only  quantifier  free 
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formulas  are  employed,  Skolem  functions  being  previously 
introduced.  Thus,  for  example,  in  a formulation  of  a GTM  proolem 
in  Reiter's  system,  Skolem  constants  would  be  introduced  to 
designate  points  in  the  premises  and  conclusion.  The 
interpretation  of  the  resulting  formula  is  that  it  is  to  be 
proved  true  for  every  interpretation  of  the  Skolem  constants. 

The  difficulty  is  that  while  the  .formula  is  to  be  proved  true  for 
every  interpretation  of  the  Skolem  constants,  the  standard  notion 
of  a model  allows  a constant  to  represent  only  one  element  of  the 
domain.  An  approach  to  solving  this  problem  is,  instead  of 
employing  a single  model  to  test  for  rejection  of  subgoals, 
several  "parallel  models"  can  be  employed,  each  representing  the 
assignment  of  Skolem  constants  to  points  in  the  diagram  which 
satisfies  the  premises  This  is  really  not  an  important  issue  in 
the  geometry  theorem  proving  example  since  there  is  likely  to  be 
only  one  non-redundant  interpretation  of  Skolem  constants  as 
points  in  the  diagram  which  satisfies  the  premises.  An  example 
which  illustrates  the  utility  of  the  parallel  models  approach  is 
the  group  theory  problem  of  proving  that  a group  in  which  the 
square  of  every  element  is  the  identity  element  is  Abelian  (the 
GR0UP2  problem  in  Appendix  A).  Here,  our  model  can  be  an  example 
of  a group  in  which  the  square  of  each  element  is  the  identity 
element.  The  restriction  that  Skolem  constants  a and  b from  the 
statement  of  the  theorem  a*b»b*a  be  associated  with  single 
elements  of  the  domain  creates  a rather  weak  model.  .In  this 
example,  we  can  generate  several  models  each  representing  a 
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different  interpretation  of  a and  b as  elements  of  the  group  and 
examine  each  suogoai  in  ail  these  models. 

Similar  parallel  models  can  be  set  up  to  handle  different 
interpretations  of  non  0-ary  Skolem  functions.  Suppose  f is  a 
unary  Skolem  function.  Then  to  decide  whether  to  reject  subgoal 
Pxf(x),  is  is  necessary  to  determine  if  for  every  parallel  model 
there  exists  an  assignment  to  x for  which  Pxf(x)  is  true.  If 
not,  the  subgoal  is  rejected.  .Unfortunately,  this  approach 
yields  a weaker  model  than  necessary.  Performing  the  above  check 
is  equivalent  to  check  VyBxPxy  whereas  the  expression  3xVyPxy  is 
a valid  expression  to  check  (since  this  is  the  formula  from  which 
Pxf(x)  is  derived  by  Skolemization) . This  gambit  of  testing  the 
model  satisfiability  of  an  expression  by  analyzing  the  intent  of 
the  Skolem  function  is  not  always  successful.  The  formula 
Pf(x)f(y)  (obtainable  by  resolution  for  example)  is  an  example  of 
a formula  uninterpretable  in  this  way  since  it  is  not  the  product 
of  Skolemizing  any  formula  (where  f is  again  assumed  to  be  a 
Skolem  function)  . This  difficulty  with  formally  describing  the 
strongest  possible  model  to  use  in  the  deductive  system  remains 
unresolved . 

These  previous  ideas  can  readily  be  extended  to  infinite 
models  by  reducing  the  latter  to  finite  models  by  choosing  a 
finite  set  of  representatives  of  the  infinite  model.  For 
example,  in  number  theory  theorem  proving,  a model  can  be  created 
by  coding  arithmetic  operations  etc.  and  supplying  a 

small,  finite  subset  of  the  natural  numbers  (e.g.,  0,  1,  2,  5, 

17,  24). 
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2.14.3.  Types  of  models.  Models  can  be  evaluated  in  teems  of 
two  criteria:  accu£acy  and  scope. 

The  issue  of  accuracy  of  a model  is  best  considered  in  the 
case  of  a complete  theory  and  is  concerned  with  whether  the  model 
ever  falsely  assigns  a value  to  an  expression. 

Two  entirely  different  effects  may  occur  if  the  model  being 
used  is  inaccurate.  The  first  possibility  is  that  the  model  will 
assign  value  true  (i.e.,  provable,  proceed)  to  a goal  which  is 
false.  The  sf'cond  possibility  is  that  the  model  will  assign 
value  false  (i.e.,  disprovable,  abort  this  proof)  to  a goal  which 
is  true.  These  two  possibilities  are  similar  to  type  I and  type 
II  errors  in  statistical  inference  respectively,  where  a type  I 
error  accepts  a false  non-null  hypothesis  and  a type  II  error 
rejects  a true  non-null  hypothesis.  We  will  call  these  two 
possibilities  type  I and  type  II  errors  following  the  statistics 
convention. 

In  some  circumstances,  the  commission  of  a type  I error, 
though  by  retaining  and  attempting  to  prove  a false  goal  it 
wastes  computational  effort,  is  harmless  from  the  standpoint  of 
completeness.  This  is  the  case  in  GTM.  Here,  if  the  model 
assigns  value  true  to  expressions  which  are  false  (by  for  example 
having  in  the  diagram  two  angles  which  are  equal  within  the 
tolerances  of  measurement  which  are  not  necessarily  equal  given 
the  statement  of  the  problem) , only  computational  effort  is 
wasted  in  possibly  attempting  to  prove  this  false  equality  and 
completeness  is  not  sacrificed. 
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The  commission  of  a type  II  error  is  generally  more  serious 
since  the  response  to  the  evaluation  of  an  expression  in  the 
model  to  false  results  in  the  elimination  of  a partial  deduction 
rather  than  the  retention  of  one.  It  is  possible  in  these 
circumstances  to  eliminate  all  valid  proofs  using  a faulty  model. 
Type  .II  errors  are  avoided  in  GTM  by  correct  construction  of  the 
diagram  and  using  tolerances  in  measurement  in  determining  wnat 
is  true  in  the  diagram  to  avoid  false  inference  on  the  basis  of 
measurement  error. 

In  statistical  inference,  there  is  a symmetry  between  type  I 
and  type  II  errors  and  the  null  and  non-null  hypotheses. 
Committing  a type  .1  error  is  equivalent  to  committing  a type  II 
error  with  respect  to  the  null  hypothesis  by  rejecting  it. 
Committing  a type  .II  error  is  equivalent  to  committing  a type  I 
error  with  respect  to  the  null  hypothesis  by  accepting  it.  This 
symmetry  persists  in  the  analogous  case  of  models.  If  the  model 
incorrectly  evaluates  an  expression  in  the  theory,  either  a type 
I or  type  II  error  will  be  committed  when  considering  that 
expression  as  a goal  and  a type  II  or  type  I error  respectively 
will  be  committed  when  the  negation  of  the  expression  is 
considered  as  a goal. 

Given  this  symmetry,  and  the  fact  that  type  II  errors  are 
generally  considered  to  be  intolerable  since  they  may  cut  off  the 
only  path  to  a proof,  how  is  it  that  GTM  can  tolerate  even  type  .1 
errors?  The  answer  is  that  GTM  never  attempts  to  prove  the 
negation  of  a goal.  Thus,  a type  .1  error  for  a goal  will  never 
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result  in  a type  II  error  for  its  negation  because  its  negation 
will  never  occur. 


i 


I 


Given  this  symmetry  indicating  that  a type  I error  for  an 
expression  results  in  a type  II  error  for  its  negation  and  vice 
versa,  and  the  intolerability  of  type  .II  errors,  we  must  be 
concerned  only  with  accurate  models  which  incorrectly  evaluate  no 
expressions  if  we  are  concerned  with  a procedure  which  will  be 
asked  to  prove  expressions  and  their  negations. 

The  scope  of  a model  refers  to  the  extent  to  which  it  defines 
values  for  expressions  which  actually  are  provable  in  the  theory. 
In  a theory,  there  are  three  possibilities  for  any  expression: 
the  expression  is  provable,  the  expression  is  disprovable  (the 
negation  of  the  expression  is  provable) , and  neither  the 
expression  nor  the  negation  of  the  expression  is  provable.  In 
the  first  case,  the  expression  will  be  termed  to  be  true;  in  the 
second  case,  the  expression  will  be  termed  to  be  false;  in  the 
third  case,  the  expression  will  be  termed  to  be  contingent  since 
its  value  is  contingent  on  information  outside  the  theory. 

A model  which  assigns  values  to  all  the  non-contingent  and 
some  of  the  contingent  expressions  of  the  theory  will  be  called 
an  exarople  since  it  is  a model  for  an  instance  of  the  object 
described  in  the  theory. 

A model  which  assigns  values  to  only  some  of  the 
non-contingent  and  none  of  the  contingent  expressions  of  the 
theory  will  be  called  an  abstraction  since  it  is  a model  for  a 
subtheory  of  the  object  described  in  the  theory. 
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A model  may  fit  neither  category,  assigning  values  to  some 
contingent  expressions  and  failing  to  assign  values  to  some 
non-contingent  expressions.  Such  a model  will  be  called  mixed. 

In  our  future  discussion,  we  will  allow  any  of  these  types  of 
models  (examples,  abstractions,  mixed  models).  However,  since  we 
are  concerned  with  problem  solving  procedures  which  deal  witn 
both  goals  and  their  negations,  we  must  restrict  ourselves, 
following  the  reasoning  above,  to  accurate  models  to  eliminate 
any  possibility  of  eliminating  all  solutions  by  false  inference 
from  models. 

The  one  sort  of  model  we  need  not  consider  is  that  wnich 
assigns  correct  values  to  all  non-contingent  expressions  and  no 
values  to  contingent  expressions.  This  model  assigns  values  true 
or  false  to  precisely  those  expressions  which  are  provable  or 
disprovable  respectively.  Since  this  is  so,  the  model  has 
exactly  the  same  deductive  power  as  the  problem  solving  system, 
and  there  would  be  no  need  for  the  latter. 

2.14.4.  .Incompatibility  of  « procedure  with  use  of  models.  We 
could  hypothesize  that  the  t procedure  is  compatible  with  the  use 
of  models — that  any  deduction  containing  a false  goal  could  be 
eliminated.  It  is  readily  apparent,  however,  that  even  refutable 
goals  must  sometimes  be  considered  in  the  v procedure.  Tne 
simplest  example  of  proof  by  contradiction  requiring  the 
reduction  operation  (matching  a goal  to  the  negation  of  a higher 
goal)  demonstrates  this. 

Example 
1.  -lA  V c 
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2.  A V C 

3.  C 

4.  ICJ  A 

5.  (Cl  lA]  •< 
b . 0 


theorem 

extend  3 by  1(a) 
extend  4 by  2(b) 
reduce  5 (a >c) 


Note  that  the  proof  requires  the  presence  of  the  goal  -»C , a 
refutable  goal  (indeed,  the  negation  of  the  goal  which  we  set  out 
to  prove)  . The  alternative  proof  which  extends  first  by  clause 
2,  then  by  clause  1 resulting  in  intermediate  goal  -»A  rather  than 
A suffers  equally  from  this  difficulty. 

This  proof  precludes  the  possibility  of  a model  refinement  of 
the  « procedure  which  eliminates  all  proofs  containing  a 
refutable  literal..  We  must  then  refine  our  definition  of  model 
unacceptable  chains  to  allow  chains  whose  literals  which  are 
false  in  the  model  are  to  be  removed  by  reduction. 

We  can  then  hypothesize  the  completeness  of  use  of  models  with 
the  1 procedure  if  rejection  of  chains  is  eliminated  where  the 
cause  of  rejection  is  the  -falsity  in  the  model  of  subgoals  to  be 
reduced.  The  following  example  with  both  possible  proofs 
illustrates  the  incompleteness  of  this  procedure. 

Example 


1.  -A 

V 

c 

2.  -.B 

V 

c 

3.  A 

V B 

4.  C 

theorem 

5.1. 

(CJ 

A 

extend 

4 by  1(b) 

6.1. 

IC] 

(A1 

-.B 

extend 

5.1  by  3(a) 

7.1. 

IC] 

(A1 

(-*B  1 

-iC 

ex  tend 

6.1  by  2(a) 

8.1. 

0 

reduce 

7.1(a,d) 

5.  2. 

(Cl 

B 

ex  tend 

4 by  2(b) 

6.2. 

(Cl 

(B1 

-.A 

extend 

5.2  by  3(b) 

7.  2. 

(Cl 

(B1 

(-A1 

■< 

extend 

6.2  by  1(a) 

8.  2. 

0 

r ed  uc  e 

7.2  (a,d) 
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The  set  of  clauses  {-•AvC,  -•BvC,  AvB}  has  3 models: 

{A,a,C},  {-lAfBfC},  and  {A,-<B,C). 

The  first  of  the  two  proofs  of  C from  these  clauses  contains 
no  goals  not  in  the  model  except  for  ->C  removed  by  reduction  for 
the  third  model  only.  The  second  of  the  two  proofs  of  C from 
these  clauses  contains  no  goals  not  in  the  model  except  for  -»C 
removed  by  reduction  for  the  second  model  only.  Note  that 

neither  proof  is  acceptable  to  the  first  model,  and  thus  the  v | 

I 

i 

procedure  is  incompatible  with  the  model  refinement  for  arbitrary 
choice  of  models. 

We  can  even  modify  the  last  example  to  illustrate  that  the  * 
procedure  must  sometimes  even  expand  subgoals  which  are 
disprovable  (false  in  any  example  model). 

Example  2^21. 

1.  -A  V c 

2.  -B  V c 

3.  A V B 

4.  -iC  V A 

5.  -iC  V B 

From  the  first  clause  we  have  that  if  A is  true  then  C is 

true.  From  the  second  clause  we  have  that  if  B is  true  then  C is 

true.  The  third  clause  states  that  A or  B is  true;  therefore,  C 
must  be  true  (by  the  first  and  second  clauses) . By  the  third 

i 

clause  we  have  that  if  C is  true  then  A is  true.  By  the  fourth  j 

clause  we  have  that  if  C is  true  then  B is  true.  Since  C is  true 

by  the  reasoning  above,  both  A and  B must  also  be  true.  Thus,  A, 

B,  and  C are  all  provable  -from  this  set  of  clauses  and  the  only 
complete  model  is  lA,B,C}. 
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The  proposed  model  refinement  states  that  partial  deductions 
containing  refutable  goals  not  removable  due  to  their  being  the 
negation  of  a higher  goal  are  eliminable.  Thus,  the  model 
refinement  in  this  instance  would  permit  the  elimination  of  any 
partial  deduction  containing  -*a,  -»D,  or  •'C  except  as  the  negation 
of  a higher  goal. 

Following  is  a complete  set  of  valid  proofs  of  the  C from  the 
set  of  axioms. 


6.  C 

7.  (Cl  A 

8.  (Cl  (Al  -B 

9.  (Cl  (Al  (-B1  -tC 

10.  0 

6.  C 

7.  (Cl  B 

a.  (Cl  (Bl  -A 

9.  (Cl  (Bl  (-A1  -C 

10.  0 


theorem 

extend  6 by  1(b) 
extend  7 by  3 (a) 
extend  8 by  2(a) 
reduce  9 (a,d) 

theorem 

extend  6 by  2(b) 
extend  7 by  3(b) 
extend  8 by  1(a) 
reduce  9 (a,d) 


The  appearance  of  -iC  in  each  of  these  proofs  is  acceptable  to 
the  model  refinement  since  in  each  case  it  is  removed  by 
reduction  and  is  thus  a negated  higher  goal.  -However,  neither 
proof  is  totally  acceptable  .to  the  model  refinement  i-f  A,  B,  and 
C are  all  in  the  model  (as  they  validly  may  be  since  all  are 
provable)  since  one  proof  contains  -•A  and  the  other  contains  ->B. 

2.14.5.  A model  refinement  'for  the  « procedure.  .If  the  v 
procedure  restriction  on  eliminating  partial  deductions  in  which 
a goal  is  identical  to  a higher  goal,  and  thus  permitting  a goal 
to  be  a subgoal  of  itself,  is  eliminated,  the  model  refinement 
does  accept  a proof  to  the  last  example. 

2.^28. 

1.  V c 

2.  "B  V c 
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3. 

A V B 

4. 

-iC  V A 

5. 

-C  V B 

6. 

C 

theorem 

7. 

IC]  A 

extend  6 

by 

1(b) 

3. 

[C]  [A]  C 

extend  7 

by 

4(b) 

9. 

[C]  [A1  [Cl  B 

extend  8 

by 

2(b) 

10 

. [C]  [A]  |C1  [3]  -A 

extend  9 

by 

3(b) 

11 

. 0 

reduce  10 (b. 

e) 

There  is,  by  the  symmetry  of  the  set  of  axioms,  a similar 
proof  in  v;hich  B rather  than  A is  the  first  subgoal  of  C. 

However,  even  this  refinement  does  not  permit  a deduction  to 
the  earlier  example  of  proving  C from  {tA  v c,  v c,  A v b} 
with  the  example  model  {A,B,C}.  If  we  further  extend  the  idea  of 
allowing  repetition  of  goals,  even  when  the  repetition  cannot  be 
formed  by  use  of  the  axioms,  a deduction  for  this  example  can  be 
found. 


Example  2. 29. 

1.  -.A  V c 

2.  -.8  V c 

3.  A V B 

4.  C 

5.  (C]  A 

6.  (C]  [A]  C 

7.  [C]  [A]  (C]  B 

8.  [C]  [A]  (Cl  (B]  -A 

9.  0 


theorem 

extend  4 by  1(b) 
repeat  higher  goal  C 
extend  6 by  2(b) 
extend  7 by  3(b) 
reduce  3 (b,e) 


This  leads  to  our  final  hypothesis  of  a complete  combination 
of  use  of  models  and  the  « procedure;  any  chain  can  be 
eliminated  if  any  A-literal  (literal  extended  upon)  is  false  in 
the  model,  provided  identical  A-literals  may  be  duplicated.  We 
will  call  this  procedure  the  M*  procedure  for  w procedure  with 
model . 

We  can  express  the  procedure  in  terms  of  a single  inference 
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rule  like  the  w procedure  except  that  all  higher  subgoals  are 
free  to  be  extended  upon,  not  just  the  lowest  goal.  Thus,  we  can 
consider  trying  to  prove  at  each  stage  the  disjunction  of  the 
lowest  goal  and  all  its  higher  goals.  Following  is  a definition 
of  the  Ml  procedure  in  terms  of  its  single  intorcncc  operation. 

, Definition  ^^^2.  Bi Bp  v ...  v if 

3,^  is  Ai  A ...  A « Cj  (m>0) 

Bi,...,ap,~Ci -Cq.-Ai  v Ci  v ...  y Cg 

and  B^, . . . ,Bp,~C2,  . . . ,'‘Cq,~Aj^  *‘Mi^i  ''  ^ . . . v Cq 

®1 Bp,~Ci,...,-Cq,~An,  v Ci  v ...  v Cq  for  some  k,j 

(l<k£p,l£j<q)  . 

•If  myo  then  Cj  must  not  be  false  in  the  model. 

J51S2ESS  The  Mi  procedure  is  sound,  i..e.,  if  BjL,...,Bp 

^ ...  ^ Cq  then  B^,...,Bp  *“  Cj  ^ ...  Cq  whe r e *” 
denotes  derivability  by  the  Mi  procedure. 

B£22l»  We  will  perform  induction  on  the  number  of 
applications  n of  the  sing-ie  Mi  iivference  rule  used  in  a Mi 

procedure  proof. 

/ 

n»l.  Assume  B]^,...,Bp  v ...  ''  Cg  with  only  one 

application  of  the  inference  rule  used  in  the  proof.  Since  only 
one  application  of  the  inference  rule  is  required,  B|^  must  be 
some  Cj  (l_<j£q)  . Otherwise,  additional  applications  would  be 
required  to  prove  Aj^,  ...,  and  A^,.  Clearly 
Bj, . . . ,B|j_]^  ,Cj  ,3|^+i  ,Bp  Cj^  ...  ''  Cq.  Therefore,  the  Mi 
procedure  is  sound  >for  this  case. 
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n>l.  Assume  the  M*  procedure  is  sound  tor  all  prooLs  with 
■ fewer  than  n applications  of  the  inference  rule,  erove  it  is 
sound  for  n applications.  Assume  Ci  v ...  v c 

using  n applications  of  the  inference  rule. 

The  proof  of  Cj^,...,Cq  employs  the  subptoofs 

*^1 3p,~Ci,...,~Cq,~Ai  -m,Ai  V Cl  V ...  v Cq . 

Bi,  . . . ,ap,~Ci,  . . . ,~Cq,-Ai  "-MiAi  ''  Ci  ...  ''  Cq  and 

^1 Bp, -Cl -Cq.-A^  -„,A„  V Cl  V ...  V Cq 

where  is  Ai  ^ ...  a Ajj,  ♦ Cj  for  some  k,j  (l^k£p,l^j£q)  . By 
the  induction  hypothesis,  since  each  of  these  subproofs  has  fewer 
than  n applications  of  the  inference  rule. 


^ 2^  # • • • f Bp  f ^ 

C 2 r • • • 

,~Cq,-Ai 

- Ai  V 

Cl 

... 

^ Cq^ 

3 ^ f • • • r Bp  t 

C 2 r • • » 

• »"*Ai 

•-  Ai  V 

Cl 

V ... 

''  Cq^ 

B ^ r • • • r ^ 

C 2 r • • • 

f “"Cq  ,-Aj5j 

A V 

Cl 

''  ... 

" Cq. 

Thus,  by  the  deduction  theorem. 

B 2 # • • • # 3p  9 

►-  Ai  V 

Cl  V . . . 

v Cq, 

3 2 r • • • f Bp  9 

H Ai  V 

Cl  V . . . 

''  Cq' 

• • • 

, and 

B 2 / • • • f Bp  9 

Cl  ''  . . . 

" Cq. 

Therefore , 

Bi , . . . 

,Bp,  •-  (Ai  a . . . 

A 

Am)  " 

Cl  V 

since  B|^  is 

Ai  A 

. . . A Aji, 

-»  Cj  we 

also  have  Bi, 

A ...  A A^) 

V c ■ 

Therefore,  Bi, 

% • • 

.Bp  - 

l-«(Al 

V Cjl  A ((Ai  A .. 

. A A„)  V 

Cl  V . 

• • 

V c 1 

and , 

• ^ ^mi 


Bi,...,Bp  Cl  ''  ...  ''  Cq.  Therefore,  tne  M»  procedure  is  sound 
for  this  case.  D 

We  do  not  yet  have  a proof  for  the  completeness  of  the  M* 
procedure,  but  know  no  counter-examples  to  it. 
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There  is  a hierarchy  of  procedures  which  can  be  applied  when 
using  models,  each  capable  of  proving  more  theorems  than  its 
predecessor  in  the  hierarchy:  the  unmodified  procedure  with 
identical  subgoal  elimination  rule,  the  « procedure  without  the 
identical  subgoal  elimination  rule,  and  the  M»  procedure  with 
totally  free  repetition  of  goals. 

The  incompatibility  of  the  higher  identical  subgoal 
elimination  rule  and  the  use  of  models  forces  a tradeoff  between 
the  use  of  models  and  the  single  most  useful  elimination  rule. 
Thus,  if  the  model  being  used  is  weak  or  non-existent,  it  may  be 
more  fruitful  to  eliminate  its  use  and  retain  the  higher 
identical  elimination  rule. 

However,  the  loss  of  the  higer  subgoal  elimination  rule  is  not 
total  since  weaker  forms  of  the  higher  identical  subgoal 
elimination  rule  may  be  substituted  while  retaining  the 
completeness  of  the  use  of  models. 

Consider  again  the  last  example  which  demonstrated  the 
incompatibility  of  the  unmodified  » procedure  with  the  use  of 
models . 


Example 

1.  -lA  V c 

2.  -.B  V C 

3.  A V B 

4.  -k:  V A 

5.  -k:  V B 

Possible  derivations  from  the  top  goal  C include; 


6.  C theorem 


7.1.  IC]  A 

7.2.  [C]  B 
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extend  6 by  1 
extend  6 by  2 


ti.1.1.  (C] 

(A)  -.u 

ex  tend 

7.1  by  3 

d.1.2.  (C] 

lA]  C 

ex  tend 

7.1  by  4 

8.2.1.  (C] 

13]  -A 

ex  tend 

7.2  by  3 

8.2.2.  IC] 

13]  C 

extend 

7.2  by  5 

9.1.1.  (C] 

(A]  l^B] 

■>C 

ex  tend 

8.1.1  by  2 

9. 1.2.1.  [C]  lA]  IC] 

A 

extend 

8.1.2  by  1 

9. 1.2. 2.  [C]  (A]  IC] 

B 

ex  tend 

8.1.2  by  2 

9.2.1.  (Cj 

IB]  (-.A] 

•^C 

extend 

8.2.1  by  1 

9. 2. 2.1.  IC]  (BJ  IC] 

A 

ex  tend 

8.2.2  by  1 

9. 2. 2. 2.  IC]  [B]  Id 

B 

extend 

8.2.2  by  2 

10.1.1.  <f) 

reduce 

9.1.1 

10.1.2.1.1. 

[C]  lA] 

[C] 

[A] 

-iB 

extend 

9. 1. 2. 1 by 

3 

10.1. 2. 1. 2. 

[Cl  [Al 

[Cl 

[A] 

C 

ex  tend 

9 . 1.  2. 1 by 

4 

10.1.  2.  2.1. 

[C]  [A] 

IC] 

IB] 

tA 

ex  tend 

9.1.2.?  by 

3 

10.1.  2.  2.  2. 

[C]  [A] 

[Cl 

IB] 

C 

extend 

9. 1.2. 2 by 

4 

10.2.1.  0 

reduce 

9.2. 1 

10.2.  2.1.1. 

[C]  [3] 

[C] 

[A] 

ex  tend 

9. 2. 2.1  by 

3 

10.  2.  2.  1.  2. 

[C]  [B] 

[C] 

[A] 

C 

extend 

9. 2. 2. 1 by 

4 

10.2.  2.  2.1. 

[C]  [B] 

IC] 

[3] 

-tA 

extend 

9. 2. 2. 2 by 

3 

10.2.  2.  2.  2. 

IC]  IB] 

[C] 

[B] 

C 

extend 

9. 2. 2. 2 by 

5 

Consider 

chains  9 

.1. 

2.  1 

([C] 

[A]  [C]  A)  and 

9. 2. 2. 2 ([C 

:]  [3] 

[C ] a).  Both  these  chains  can  be  eliminated  with  no  loss  of 
completeness.  The  justification  for  this  is  that  every  operation 
or  sequence  of  operations  employed  to  remove  the  terminal  literal 
A of  chain  i#. 1.2.1  or  the  terminal  literal  B of  chain  9. 2. 2. 2 can 
equally  well  be  employed  to  remove  the  earlier  A or  B.  The 
critical  observation  is  that  in  each  of  these  chains  the  new 
terminal  literal  has  the  same  set  of  higher  non-identical 
subgoals  as  an  earlier  occurrence  of  that  same  literal,  i.e.,  A 
has  only  higher  non- identical  subgoal  C in  both  occurrences  in 
chain  9. 1.2.1  and  B has  only  higher  non- identical  subgoal  C in 
both  occurrences  in  chain  9. 2.  2.  2. 

The  retention  of  needed  chains  such  as  b.1.2  ([C]  [A]  C)  and 

• .1.2  MCI  (3)  C)  .which  also  have  higher  identical  goals  is 
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justified  since  the  new  later  occurrences  of  C appears  with  a 
higher  goal  A (in  8.1.2)  or  B (in  8.2.2)  which  was  not  a higher 
goal  of  the  earlier  occurrences  of  C. 

The  rule  can  be  expressed  asi:  any  deduction  in  which  a goal 
has  the  same  set  of  higher  non- identical  (to  itself)  goals  as  a 
higher  occurrence  of  the  same  goal  can  be  eliminated. 
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3.  .Unification  Algorithms  for  Artificial  .Intelligence  Languages 


3.1.  .Introduction 

An  extremely  important  component  of  many  artificial 
intelligence  languages  is  a general  pattern  matcher  for  symbolic 
expressions.  .In  present  artificial  intelligence  languages, 
pattern  matching  is  typically  uni-directional,  i.e.,  the 
expressions  are  not  treated  symmetrically  permitting  variable 
assignments  in  both  expressions.  (Indeed,  we  will  often  use  tne 
term  pattern  matching  to  refer  to  the  case  of  unification  in 
which  at  least  one  of  the  two  expressions  has  no  variables.)  This 
is  a major  source  of  incompleteness  since  it  is  clear  that  some 
form  of  unification  (which  is  bi-directional)  is  required  for 
completeness.  .In  addition  to  the  incompleteness  aspect  of  this 
uni-directional  pattern  matching,  even  where  completeness  and 
full  capability  for  unification  is  not  required,  it  is  often 
desirable  to  have  at  least  a limited  bi-directional  pattern 
matching  capability.  For  example,  in  OLI&P  it  .would  be 
desirable  if  when  (GOAL  (P  »X  **Y)  ...)  is  attempted,  functions 
with  pattern  (P  0 **Z)  would  be  evoked  as  well  as  patterns  with 
greater  or  equal  generality  than  the  evoking  pattern  (P  ♦X  ♦♦Y). 

The  solution  which  we  argue  for  here  is  based  on  two 
components:  semi-unification  and  variable  splitting  (or,  in  more 

general  terms,  semi-unification  and  elaboration). 


171 


Semi-unification  algorithms  are  presented  <for  important  data 
typea:  vectors,  bags,  and  -classes.  These  algorithms  return  the 
most  immediate  and  natural  (in  general  incomplete)  set  of 
unifiers  for  two  expressions.  Semi-unification  can  be  used 
without  variable  splitting  -for  languages  and  applications  where 
completeness  is  not  required  but  some  bi-directional  pattern 
matching  capability  is.  Where  completeness  is  required,  the 
additional  process  of  variable  splitting  or  elaboration  can  be 
incl uded . 

In  order  to  motivate  this  solution,  let  us  consider  the 
alternatives. 

.The  -first  possibility  is,  of  course,  to  merely -write  complete 
unification  algorithms  -for  the  desired  data  types  and  be  done 
with  it.  .There  are  theoretical  difficulties  in  this  approach. 
One  of  the  data  types  treated  here,  the  vector,  is  simply  a 
sequence  of  elements  in  order  with  possible  repetition  of  the 
elements.  The  use  of  fragment  variables,  which  in  pattern 
matching  permit  the  matching  of  the  variable  to  an  arbitrary 
number  of  elements  in  the  opposing  expression,  results  in  the 
possible  generation  of  an  infinite  number  of  unifiers  for  vector 
unification.  A typical  example  of  this  phenomenon  is  the 
unification  of  expressions  (VECTOR  A ♦•►X)  and  (VECTOR  ♦♦X  A) 
where  A is  a constant  and  X is  a -fragment  variable.  Since  X can 
match  an  arbitrary  number  of  elements,  the  two  expressions  have 
unifiers  with  X assigned  the  empty  vector,  the  vector  consisting 
of  one  A,  the  vector  consisting  of  two  A's,  etc,  with  unifying 
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expressions  (VECTOR  A),  (VECTOR  A A),  (VECTOR  AAA),  etc., 
respectively. 

This  problem  does  not  occur  for  the  other  two  data  types 
considered  here.  Me  present  complete  unification  algorithms  >for 
bags  and  classes.  .It  may  also  be  possible  to  write  a complete 
unification  algorithm  for  vectors  which  adopts  a more  general 
representation  of  unifiers  which  .finitely  expresses  an  infinite 
set  of  unifiers.  For  example,  the  entire  set  of  unifiers  of  the 
two  expressions  above  could  be  expressed  as  X*  (VECTOR  A*')  for  all 
n^O . 

Even  where  this  approach  is  possible  (as  for  bags  and 
classes),  it  is  not  always  appropriate.  The  striving  for  logical 
completeness  in  pattern  matching  obviously  iias  costs,  and 
language  designers  and  users  are  not  always  willing  to  pay  those 
costs.  Except  in  those  cases  where  completeness  is  required  in 
essentially  all  usage  of  the  system,  an  approach  wnich  permits 
completeness  while  also  allowing  for  more  restrictive  incomplete 
pattern  matching  is  appropriate. 

A second  alternative  is  the  deferral  of  completion  of  pattern 
matching  until  the  skeleton  of  a solution  is  .found.  The  solution 
is  correct  provided  the  deferred  matches  succeed.  This  is  the 
approach  of  Huet's  constrained  resolution  [58].  .It  has  the 
advantage  of  not  r.eauir.ing  .complete  unification  at  each  step  in  a 
deduction,  but  allowing  .for  the  demonstration  of  the  existence  of 
a set  of  unifiers  at  the  end  of  a trial  deduction.  Tnis 
suostitution  of  determining  the  existence  of  unifiers  for  sets  of 
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expressions  for  the  computing  of  complete  sets  of  unifiers  (i.e., 
demonstrating  existence  of  unifiers  rather  than  computing  most 
general  unifiers)  can  result  in  a substantial  saving  of 
computational  effort  as  well  as  bypassing  theoretical  issues 
concerning  computation  of  most  general  unifiers.  This  approach 
has  the  disadvantage  in  the  design  of  Artificial  .Intelligence 
Languages  of  employing  not  fuLly  instantiated  goals  and  subgoals 
(not  fully  instantiated  because  the  pattern  matching  is  being 
deferred) . This  lack  of  instantiation  will  inhibit  the  full 
power  of  the  heuristics  programmed  in  the  language  to  select 
among  goals  to  be  attempted.  With  respect  to  the  programmed 
heuristics,  it  is  more  advantageous  to  have  -fully  instantiated 
goals  which  can  be  evaluated  -for  rejection  or  interest  than  to 
save  some  computation  by  leaving  goals  uninstantiated. 

A third  alternative  is  the  use  of  a parameterized  incomplete 
pattern  matching  or  unification  algorithm.  Such  an  algorithm 
would,  -for  example,  return  unifiers  of  increasing  complexity  for 
increasing  values  of  a numerical  parameter.  A problem  solver 
using  such  an  algorithm  -could,  upon  -failure,  automatically 
backtrack  to  a previous  pattern  matching  or  unification 
operation,  increase  the  parameter  value,  and  continue  with  an 
expanded  set  of  unifiers.  The  principal  disadvantage  of  this 
approach  is  the  lack  of  control  of  the  system  user  over  the 
unifiers  being  returned.  The  mechanism  for  generating  additional 
unifiers  is  hidden  from  him. 

The  fourth  approach,  the  one  being  adopted  here,  provides  the 
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user  with  this  type  of  control.  The  operations  which,  when 
applied  to  the  expressions  being  unified,  result  in  the 
generation  of  additional  unifiers  are  explicitly  available  to 
him.  The  generality  of  this  approach  as  compared  to  the  third 
approach  is  that  this  approach  could  be  used  to  implement  the 
second  approach. 

3.2.  iTerrainology 

Expressions  are  written  in  iQver se_guote__mode.  In  contrast  to 
LISP  in  which  expressions  are  evaluated  unless  explicitly  quoted, 
I in  inverse  quote  mode  expressions  are  evaluated  only  if  preceded 

by  a prefix  character  or  characters  indicating  a variable  to  be 
evaluated  or  assigned  a value.  The  -function  of  elements  of 
expressions  (whether  the  element  is  a constant  or  variable)  is 
denoted  by  the  absence  or  presence  and  type  of  a prefix  as 
opposed  to  alphabetical  conventions  as  in  formal  logic. 

PsIiSiiiSD  iii*  An  expression  is  defined  to  be 

(1)  a constant, 

(2)  a simple  variable,  or 

(3)  a -list  consisting  of  an  expression  (the  head  of  the 
expression)  followed  by  an  arbitrary  number  of  elements  (the  tail 
of  the  expression)  . 

Expressions  which  are  constants  or  simple  variables  are  termed 
atomic  expressions;  other  expressions  are  termed  nqn-atomic 
expr  essions . 
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Definition  3^2.  An  eleinent  (of  an  expression)  is  defined  to 
be 

(1)  an  expression,  or  ' 

(2)  a 'fragment  variable. 

An  element  will  be  represented  by  lower  case  Latin  letters  a, 
b,  etc.  A sequence  of  elements  will  be  represented  by  lower  case 
Greek  letters  g,  etc. 

ls.1’  A 3round_exBression  is  an  expression  with  no 

variables. 

P?liDi£i22  lii*  The  type  of  each  non-atomic  expression  is 
determined  by  its  head.  The  expression  type  designates  the 
storage  and  pattern  matching  or  unification  discipline  to  be  used 
for  the  expression.  Only  expressions  with  the  same  type  can  be 
successfully  matched.  Three  types  are  presently  defined: 
vectors,  bags,  and  -classes. 

1^5.  A vector  is  an  ordered  sequence  of  elements 
permitting  repetition  of  elements.  .It  can  represent  the  argument 
list  of  an  associative  or  non-associative  function  with  a fixed 
or  variable  number  of  arguments. 

PSliDi5i2D  i*.6.  A Baa  is  an  unordered  sequence  of  elements 
permitting  repetition  of  elements.  .It  can  represent  the  argument 
list  of  an  associative  or  non-associative  commutative  -function 
with  a fixed  or  variable  number  of  arguments. 

P£ii2i£i2D  lil*  A class  is  an  unordered  sequence  of  elements 
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not  permitting  repetition  of  elements.  .It  is  a mathematical  set 
and  can  represent  the  argument  list  of  an  associative  or 
non-associative  function  with  set  argument. 

Expression  heads  VECTOR,  BAG,  and  CLASS  have  type  vector,  bag, 
and  class  respectively. 

Examples  of  functions  with  vector  arguments  are  the  LISP 
functions  "list”  and  "append".  .The  ‘Mist"  function  takes  an 
arbitrary  number  of  arguments,  but  is  not  associative,  i..e., 
.list(x;.list(y;zl  ]?<listMist[x;y]  ;2l  . The  v" append"  'unction  takes 
an  arbitrary  number  of  arguments  and  is  associative. 

Examples  of  functions  with  bag  arguments  are  the  arithmetic 
functions  "plus"  and  "mean"..  .The  "plus"  .function  takes  an 
arbitrary  number  of  arguments  and  is  associative.  The  "mean" 
■function  takes  an  arbitrary  number  of  arguments  and  is  not 
associative. 

Examples  of  -functions  with  class  arguments  are  the  logical 
functions  "and"  and  "equivalence".  Both  are  associative. 

A constant  is  defined  to  be 

(1)  a numerical  constant,  e.g. , 3,  -2,  4.5,  or 

(2)  a symbolic  constant,  e.g..,  ;12A,  x,  ABC,  but  not  .12,  *X.  A 
symbolic  .constant  is  represented  as  a string  of  characters  not 
interpretable  as  a numerical  constant  or  a variable. 

Definition  3^9.  A variable  is  represented  as  a symbolic 
constant  preceded  by  a prefix.  ^Prefixes  are  •,  *♦,  *♦*,  $,  $$, 
and  $$$. 
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P®liDi^i2D  A ^ variable  whose  prefix 

is  one  of  * and  $. 

PSfiDitiSG  ^ f ® variable  whose 

prefix  is  one  of  $$,  and  $$$. 

The  different  variable  prefixes  have  different  interpretations 
in  the  context  of  a pattern  matching  or  unification  operation. 

The  three  prefixes  using  «■  denote  assignment  of  values  to 
variables  regardless  of  past  variable  values.  The  three  prefixes 
using  $ require  past  variable  values  to  be  matched  in  the 
matching  expression.  .If  the  variable  has  not  previously  been 
assigned  a value,  $ prefixes  behave  like  their  * counterparts, 
i.e.,  $X  acts  as  ♦X,  $$X  acts  as  ♦♦X,  and  $$$X  acts  as  ♦♦♦X. 

Z Three  kinds  of  ♦ prefixes  are  used; 

(1)  * represents  the  matching  of  the  variable  to  a single 
expression , 

(2)  ♦♦  represents  the  matching  of  the  variable  to  zero  or  more 
el ements , 

(3)  represents  the  matching  of  the  variable  to  one  or  more 
el ementa. 

.In  a pattern  matching  or  unification  operation,  the  value 

assigned  to  the  variable  is  an  expression  computed  from  the 

matching  expression.  .In  the  case  of  simple  variables  (with 
prefix  * ot  $),  the  value  assigned  is  simply  the  matched 
expression.  .In  the  case  of  fragment  variables  (with  prefix  **, 

, or  $$$)  , the  value  assigned  is  an  expression  whose  head 

is  the  same  as  the  head  of  the  expression  whose  elements  are 
being  matched  and  whose  tail  is  the  sequence  of  matched  elements. 
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^ BE§£ii§§«  Three  kinds  of  $ prefixes  are  used; 

(1)  in  any  context,  the  evaluation  of  $X  results  in  the  value  of 

X, 

(2)  in  the  context  of  an  expression  whose  type  is  the  same  as  the 
type  of  the  value  of  X,  the  evaluation  of  $$X  results  in  the 
insertion  of  the  elements  of  the  value  of  X into  the  expression; 
in  other  contexts,  the  evaluation  of  $§X  is  illegal, 

(3)  in  the  context  of  an  expression  whose  head  is  the  same  as  the 
head  of  the  value  of  X,  the  evaluation  of  $S$X  results  in  the 
insertion  of  the  elements  of  the  value  of  X into  the  expression; 
in  other  contexts,  the  evaluation  of  $$$X  results  in  the  value  of 
X if  it  has  other  than  one  element,  the  single  element  of  its 
value  if  it  has  exactly  one  element. 
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Example  3^1.  Assume  F and  G are  expression  heads  of  type 
vector  and  H is  an  expression  head  of  type  bag.  The  first  column 
gives  values  of  variable  X;  the  remaining  columns  give  the  result 
of  evaluating  (substituting  into)  various  expressions. 


X 

§$X 

$$SX 

A 

A 

illegal^ 

illegal^ 

(F) 

(F) 

(VECTOR)  2 

(F)2,3 

(F  B) 

(F 

B) 

(VECTOR  B)2 

b2 

(F  C 0) 

(F 

C 

D) 

(VECTOR  C D)  2 

(F  C D) 2 

X 

Al. 

I 

$X  JI 

IF  I $5X  JI 

IF  I $|$X  JI 

A 

(F 

1 

A J) 

illegal^ 

illegal]^ 

(F) 

(F 

I 

(F)  J) 

(F  I J) 

(P  I J)3 

(F  B) 

(F 

I 

(F  B)  J) 

(F  I B J) 

(F  I B J) 

(F  C D) 

(F 

I 

(F  C 0)  J) 

(F  I C D J) 

(F  I C D J) 

X 

..19. 

I 

$X  J) 

(G  I $$X  J) 

IG  I $$$X  JI 

A 

(G 

I 

A J) 

illegal^ 

illegally 

(F) 

(G 

I 

(F)  J) 

(G  I J) 

(G  I (F)  J)3 

(F  B) 

(G 

I 

(F  B)  J) 

(G  I B J) 

(G  I B J) 

(F  C D) 

(G 

I 

(F  C D)  J) 

(G  I C D J) 

(G  I (PC  D) 

J) 

X 

k-. 

IS 

I 

SX  J) 

(H  I $$X  J) 

(H  I $$$X  J) 

A 

(H 

I 

A J) 

illegal^ 

illegal^ 

(F) 

(H 

I 

(F)  J) 

illegal^ 

(H  I (F)  J)3 

(F  B) 

(H 

I 

(F  B)  J) 

illegal j 

(H  I B J) 

(F  C D) 

(H 

I 

(F  C D)  J) 

illegal^ 

(HI  (F  C D) 

J) 

must  have  a non-atoraic  expression  as  its  value  for  $$X 
or$$$X  to  be  successfully  evaluated. 


^This  represents  an  extended  definition  of  expressions.  The 
original  definition  stated  that  $$X  and  $$$X  are  elements  but  not 
expressions . 

^This  case  should  rarely  if  ever  occur.  It  is  intended  that 
♦♦♦  prefix  variables  be  matched  with  non-empty  expression 
sequences . 

^The  type  of  the  expression  containing  the  $$X  must  agree  with 
the  type  of  the  value  of  X. 
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Note  that  these  rules  -for  variable  prefixes  differ  slightly 
■from  those  of  the  latest  version  of  OLISP  (44,53],  .In  the  latest 
OLISP,  the  value  of  a .fragment  variable  will  be  "stripped”  when 
instantiated  in  some  other  expression,  L.e. , its  outer 
parentheses  and  type  will  be  removed,  regardless  of  whether  the 
type  of  the  -fragment  variable  and  the  type  of  the  expression 
agree.  For  example,  if  X has  the  value  (VECTOR  C D)  , (BAG  I $5X 
J)  would  have  the  value  (BAG  I C D J) . This  is  a pragmatically 
reasonable  interpretation  of  55X,  but  is  less  reasonable 
logically. 

5®fiQiti2Q  A substi tution_component  is  an  ordered  pair 

of  a variable  v and  an  expression  e written  as  v»e.  A 
substitution  component  denotes  the  assignment  of  the  expression 
to  the  variable  or  the  replacement  of  the  variable  by  the 
expression . 

P®£iDi;£i2Q  l&Il.  a substitution  is  a set  of  substitution 
components  with  distinct  first  elements,  i.e.,  distinct  variables 
being  substituted  for.  Applying  a substitution  to  an  expression 
results  in  the  replacement  of  those  variables  of  the  expression 
included  among  the  first  elements  of  the  substitution  components 
by  the  corresponding  expressions.  The  precise  form  of 
replacement  depends  on  the  prefix  of  the  variable,  i.e.,  whether 
it  is  a simple  or  -fragment  variable,  as  discussed  above.  The 
substitution  components  are  applied  to  the  expression  in  parallel 
and  no  variable  occurring  in  the  second  element  of  a substitution 
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component  will  be  replaced  even  it  it  occurs  as  the  first  element 
in  another  substitution  component.  Substitutions  will  be 
represented  by  lower  case  Greek  letters  a,  e.  The  application  of 
substitution  e to  expression  x is  denoted  by  xe  . The  composition 
of  substitutions  Oo  denotes  the  substitution  whose  effect  is  the 
same  as  first  applying  substitution  e,  then  applying  substitution 
a,  i.e.,  x(0a)  = (x0)o  .for  every  expression  x. 

^ unifier  of  two 

expressions  is  a substitution  which  when  applied  to  the  two 
expressions  results  in  equivalent  expressions.  Two  vectors  are 
equivalent  if  they  are  identical;  two  bags  are  equivalent  if  they 
contain  the  same  elements  in  the  same  number;  two  classes  are 
equivalent  if  they  contain  the  same  elements.  (In  an  actual 
implementation,  bags  and  .classes  would  ordinarily  be  retained 
only  in  a canonical  form  so  that  equivalent  bags  and  classes 
would  have  identical  canonical  forms.  Bags  and  classes  can  be 
put  into  a canonical  form  by  applying  a sorting  function  on  the 
elements  to  order  them  in  a standard  way  and  eliminating 
duplicate  elements  of  classes.  .In  the  following,  we  will  often 
overlook  the  distinction  between  identical  and  merely  equivalent 
expressions. ) 

SSliDiiiSD  One  expression  is  an  instance  of  another  if 

there  exists  a substitution  whose  application  to  the  second 
expression  results  in  the  .first. 

^ne  expression  is  a 3 round_instance  of 
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another  if  it  is  a ground  expression  and  is  an  instance  of  the 
other  expression. 

PSfiDi^i2Q  0*16  expression  is  a generalization  of 

another  if  the  second  is  an  instance  of  the  first. 

Similarly,  one  substitution  is  an  instance  (generalization)  of 
another  if  for  every  expression  the  result  of  applying  the  first 
substitution  to  the  expression  is  an  instance  (generalization)  of 
the  result  of  applying  the  second  substitution  to  the  expression. 


3.3.  .Semi-unification  and  variable  splitting 

One  way  of  providing  completeness  in  the  pattern  matching 
process  in  artificial  intelligence  languages  is  to  separate  the 
pattern  matching  process  into  two  components. 

One  of  these  components  is  the  current  pattern  matching 
process  or  some  limited  extension  of  it.  The  intent  here  is  to 
provide  a pattern  matching  procedure  which  returns  a subset  of 
the  unifiers  of  two  expressions.  This  subset  of  the  unifiers 
contains  the  most  natural  and  necessary  unifiers — those  that  even 
language  designers  not  concerned  with  completeness  should  expect 
to  be  returned  by  their  pattern  matcher.  This  first  component 
would  return  this  subset  of  the  unifiers  with  little  more 
computational  effort  than  conventional  pattern  matchers  thereby 
eliminating  objections  that  providing  completeness  is  too  costly. 

The  second  component  is  a process  which  can  be  performed  on  a 
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pair  of  expressions  to  cause  the  first  component  to  compute 
additional  unifiers.  Every  unifier  should  be  (an  instance  of)  a 
unifier  generated  by  the  first  component  after  some  number  of 
applications  of  the  second  component  to  the  exprcssionc. 

Tnere  are  numerous  possible  ways  of  designing  these  two 
components.  A large  class  of  these  are  described  in  the  section 
“Semi-unification  and  elaboration". 

Our  choice  here  is  to  let  the  first  component  oe  a slight 
extension  of  the  ordinary  pattern  matching  process  to  permit 
variables  of  each  expression  to  be  assigned  values  in  the  pattern 
matching  process.  .It  has  already  been  argued  that  this  sort  of 
extension  is  useful  even  in  the  absence  of  total  completeness, 
thereby  justifying  its  inclusion  in  the  first  (incomplete) 
component  ratner  than  the  (completeness  providing)  second 
component.  This  component  is  called  semi-unification  since  it 
has  the  characteristics  of  unification  in  its  ability  to  assign 
values  to  variables  of  both  expressions  but  is  incomplete  and 
cannot  therefore  be  simply  called  unification. 

Semi-unification  as  presented  here  is  compl<»te  in  the  case 
where  no  fragment  variables  are  present.  .Its  incompleteness  in 
the  case  where  ’fragment  variables  are  present  results  ’from  its 
failure  to  consider  one  ’fragment  variable  including  in  its  value 
part  of  another.  Our  second  component  facilitates  such 
assignments  being  made  in  semi-unification  by  replacing  fragment 
variables  by  two  new  fragment  variables  (splitting  the  variable). 
This  process  is  called  yar iable_splittina . 
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For  example,  consider  the  matching  of  the  expressions  (VECTOR 
♦♦X  A)  and  (VECTOR  B ♦♦Y) . Semi-unification  is  just  essentially 
a bi-directional  pattern  matching  process  which  returns  a single 
unifier.:  {X*(VECTOR  B)  , Y^(VECTOR  A)}.  This  is  incomplete 

because  such  unifiers  as  {X*(VECTOR  B C) , Y»(VECTOR  C A)}  are  not 
discovered.  The  failure  to  discover  (a  generalization  of)  the 
latter  unifier  is  a consequence  of  semi-unification  not  including 
parts  of  fragment  variables  in  the  value  of  another. 

A single  application  of  variable  splitting,  replacing  either 
♦♦X  or  ♦♦Y  by  two  new  fragment  variables,  results  in  the 
generation  of  the  most  general  unifier  of  the  two  expressions  by 
semi-unification  applied  to  the  expressions  after  variable 
splitting.  Splitting  **X,  (VECTOR  ♦-Xl  -♦X2  A)  and  (VECTOR  B 
♦-Y)  are  obtained.  Semi-unification  returns  unifiers  including 
{Xl*(VECTOR  B)  , Y-.-(VECTOR  **X2  A)}  or,  displaying  the  value  of  X, 
{X*  (VECTOR  B >— X2),  Y*  (VECTOR  X2  A)). 

We  will  present  in  detail  semi-unification  algorithms  for 
vectors,  bags,  and  classes,  prove  that  they  terminate  and  are 
sound  (return  no  false  unifiers)  , and  that  semi-unification  plus 
variable  splitting  is  complete. 

yje  will  assume  in  the  following  that  every  non-atomic 
expression  is  either  a vector,  bag,  or  class,  that  values  for 
bound  variables  have  already  been  substituted  in  the  expressions 
so  that  the  only  remaining  variables  are  unbound,  and  that  only 
fragment  variables  .with  the  *♦  prefixes  are  used.  Extension  to 
typed  expression  heads  other  than  VECTOR,  BAG,  or  CLASS  and 
prefixes  is  straightforward. 
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3.4.  Semi-unification 


The  procedures  unify  and  semi-unify  are  described  below  in  the 
one  procedure  (semi-) unify . The  string  “( semi- ) unify"  should  be 
uniformly  replaced  in  the  text  of  the  procedure  by  either  "unify" 
or  "semi-unify"  to  generate  the  two  procedures.  The  unify 
procedure  is  intended  to  be  the  central  routine  of  a complete 
procedure  for  unifying  expressions  (if  such  a thing  exists).  The 
semi-unify  procedure  is  intended  to  be  the  central  routine  of  an 
incomplete  procedure  -for  unifying  expressions  which,  in  concert 
with  variable  splitting,  yields  completeness. 

The  unify  and  semi-unify  procedures  themselves  handle  the 
cases  where  one  or  both  of  the  expressions  is  a constant  or 
variable;  other  cases  are  handled  by  calling  the  appropriate 
vector,  bag,  or  class  unification  or  semi-unification  procedure. 

The  unify  and  semi-unify  procedures,  and  each  of  the 
subordinate  procedures,  return  a set  of  all  the  unifying 
substitutions  found.  .If  the  expressions  are  not  unlfiable,  the 
empty  set  is  returned. 

Examples  of  vector,  bag,  and  class  semi-unification  appear  at 
the  end  of  the  descriptions  of  the  algorithms. 

1.  If  x»y  then  return  {{}). 

2.  if  x is  a simple  variable  of  the  .form  «-X  and 

a.  y is  a simple  variable  or  constant  then  return  {{X*-y}). 

b.  y is  a vector  of  the  -form  (VECTOR  B)  and 


1.  X is  not  contained  in  b then  return  {{X»y}). 

2.  X is  contained  in  s then  let  e®lX*  (VECTOR  ♦*?) } and 
return  (ea  I o e (semi-) unify ( (VECTOR  ♦*T) ; ye]}. 

c.  y is  a bag  of  the  'form  (BAG  B)  and 

1.  X is  not  contained  in  B then  return  {{X^y}}. 

2.  X is  contained  in  b then  let  o»{X»(BAG  ♦•»T)}  and  return 

(eo  I o € (serai-)  unify  ( (BAG  **T);  ye]}. 

d.  y is  a class  of  the  form  (CLASS  B)  and 

1.  X is  not  contained  in  b then  return  {{X*y}}. 

2.  X is  contained  in  b then  let  e={X*(CLASS  ♦♦Tl  ♦*T2)}  and 
return  {ea  I a « (serai-)  unify  [ (CLASS  **T1  ♦♦T2);  ye]}. 

3.  If  y is  a simple  variable  of  the  form  ♦Y  and 

a.  X is  a constant  then  return  {{Y^x}}. 

b.  X is  a vector  of  the  form  (VECTOR  a)  and 

1.  Y is  not  contained  in  a then  return  {{Y*x}}. 

2.  Y is  contained  in  a then  let  e*{Y*(VECTOR  ♦♦T)  } and 
return  {ea  I a c (serai-) unify (xe;  (VECTOR  *♦!)  ] } . 

c.  X is  a bag  of  the  form  (BAG  a)  and 

1.  Y is  not  contained  in  o then  return  {{Y*x}}. 

2.  Y is  contained  in  a then  let  e“{Y*-(BAG  ♦♦T)]  and  return 

{ea  I a € (serai-)  unify  { xe;  (BAG  ♦♦T)]}. 

d.  X is  a class  of  the  form  (CLASS  a)  and 

1.  Y is  not  contained  in  a then  return  {{Y*x)}. 

2.  Y is  contained  in  o then  let  e«{Y*(CLASS  ♦♦Tl  ♦♦T2) } and 
return  {eo  | a € (semi-) unify [ xe ; (CLASS  ♦♦Tl  ♦♦T2)]}. 

4.  If  X is  a constant  or  y is  a constant  then  return  {}. 

5.  If  X is  a vector  and 
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a.  y is  a vector  then  (semi-) unifyvector (x;  y] . 

bi.  y is  not  a vector  then  return  {). 

6.  If  X is  a bag  and 

a.  y is  a bag  then  ( semi-) uni fybag [ x;  y] . 

b.  y is  not  a bag  then  return  {}. 

7.  If  X is  a class  and 

a.  y is  a class  then  (semi-) unifyclass [x;  y] . 

b.  y is  not  a class  then  return  {}. 

Notes: 

1.  -If  the  two  expressions  being  unified  are  equal,  the  null 
substitution  is  returned  as  the  only  unifier  (step  1).. 

2.  .If  one  of  the  expressions  is  a simple  variable,  the 
substitution  of  the  other  expression  for  the  variable  is  returned 
as  the  only  unifier  provided  the  variable  is  not  contained  in  the 
other  expression  (steps  2a,  3a,  2bl,  2cl,  2dl,  3bl,  3cl,  3dl).. 

.If  both  expressions  are  variables,  the  second  will  be  substituted 
for  the  first.  .If  the  variable  is  contained  in  the  expression 
and  the  expression  is  a vector,  bag,  or  class,  a vector,  bag,  or 
class  consisting  of  new  fragment  variables  will  be  assigned  to 
the  variable  and  unification  will  proceed  (steps  2b2,  2c2,  2d2, 
3b2,  3c2,  3d2)..  This  permits  the  discovery  of  unifiers  for  such 
unifications  as  *X  with  (VECTOR  ♦♦X)  (X  is  an  arbitrary  vector) 
and  <-X  with  (CLASS  (X  is  an  arbitrary  class,  Y is  an 

arbitrary  subclass  of  X)i. 

3.  Non- identical  constants  are  not  unifiable  (step  4). 

4.  Only  vectors  can  be  unified  with  vectors,  only  bags  can  be 
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unified  with  bags,  and  only  vectors  can  be  unified  with  vectors 
(steps  5,  6,  7)  . 


3.5.  Vector  semi-unification 

.The  semi-unifyvector  procedure  returns  an  incomplete  list  of 
unifiers  for  two  vectors. 

The  semi-unifyvector  procedure,  when  presented  with  two 
non-empty  vectors,  always  removes  the  first  element  of  the  first 
vector  at  each  step. 

wnen  presented  with  two  vectors  whose  first  elements  are  not 
fragment  variables,  it  directly  unifies  the  first  elements  and 
then  unifies  the  instantiated  remainders  of  the  vectors.  When 
one  or  both  of  the  first  elements  of  the  vectors  are  fragment 
variables,  each  possible  substitution  of  an  initial  subvector 
(including  the  empty  vector)  from  the  opposing  vector  for  the 
fragment  variable  is  tried. 

1.  If  X is  of  the  form  (VECTOR)  and 

a.  y is  of  the  form  (VECTOR  —Y  8)  then  let  e« {Y- (VECTOR)  ) 
and  return  {ea  I a e semi-unify  ( x;  (VECTOR  B)e)}. 

b.  y is  not  of  the  form  (VECTOR  ♦♦Y  s)  then  return  {}. 

2.  If  y is  of  the  form  (VECTOR)  and 

a.  X is  of  the  form  (VECTOR  ♦♦X  a)  then  let  6*{X* (VECTOR) ) 
and  return  {oo  I a e semi-unify [ (VECTOR  a)e;  y]}. 

b.  X is  not  of  the  form  (VECTOR  ♦♦X  a)  then  return  {}. 


189 


3.  If  X is  of  the  form  (VECTOR  a a)  and  y is  of  the  .form 
(VECTOR  a 0)  for  arbitrary  element  a then  semi-unify( (VECTOR 
a);  (VECTOR  B)]. 

4.  If  X is  of  the  form  (VECTOR  •— X o)  then 

a.  if  y is  of  the  form  (VECTOR  ♦*Y  B)  then  for  each  partition 
of  X into  (VECTOR  a y)  for  which  Y is  not  contained  in  o 
let  a={Y*(V£CTOR  a))  and  return  {8o  | o e 

semi-unify( (VECTOR  y)9;  (VECTOR  B)0j}  and 

b.  for  each  partition  of  y into  (VECTOR  B 6)  for  which  X is 
not  contained  in  B let  e= {X* (VECTOR  B)}  and  return  {Bo  | a 
€ semi-unify I (VECTOR  a) 9;  (VECTOR  «)0]}. 

5.  If  X is  of  the  .form  (VECTOR  a o)  where  element  a is  not  a 
•fragment  variable  and 

a.  y is  of  the  -form  (VECTOR  ♦*Y  B)  then  for  each  partition  of 
X into  (VECTOR  o y)  for  which  Y is  not  contained  in  o let 
e* {Y* (VECTOR  a))  and  return  {9o  | o € semi- unify! (VECTOR 
y)9;  (VECTOR  8)01). 

b.  y is  of  the  form  (VECTOR  b B)  where  element  b is  not  a 
•fragment  variable  then  return  l0o  1 9 e semi-unify  I a ; b]  a 
o € semi-unify! (VECTOR  o)0;  (VECTOR  B)0]). 

Notes: 

1.  .If  one  of  the  vectors  is  the  empty  vector  and  the  other  is 
not,  the  latter  must  be  composed  solely  of  fragment  variables 
which  must  be  bound  to  the  empty  vector.  Otherwise,  there  is  no 
unifier  (steps  1,  2)4. 

2.  .If  the  .first  elements  of  two  vectors  are  identical,  the 
remainders  of  the  vectors  are  unified  (step  3). 
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3.  If  the  first  element  of  one  or  both  of  the  vectors  is  a 
fragment  variable,  each  substitution  of  any  initial  subvector  of 
the  opposing  vector  not  containing  the  variable  is  performed  and 
the  remainders  of  the  vectors  are  unified  for  each  substitution 
(steps  4a,  4b,  5a)  . 

4.  If  neither  of  the  first  elements  of  the  vectors  is  a 
fragment  variable,  the  two  -first  elements  are  unified  against 
each  other  and  the  remainders  of  the  vectors  are  unified  for  each 
unifier  of  the  -first  elements  (step  5b)  . 

3.6.  Bag  semi-unification 

The  semi-unif ybag  procedure  returns  an  incomplete  list  of 
unifiers  -for  two  bags. 

The  semi-unif  ybag  procedure  first  eliminates  by  pairs  any 
element  appearing  in  both  bags. 

The  semi-unif  ybag  procedure,  when  presented  with  two  non-empty 
bags  with  no  common  elements,  always  removes  the  first  element  of 
the  first  bag. 

When  presented  with  a -first  bag  whose  first  element  is  not  a 
fragment  variable,  each  match  between  the  element  and  each 
element  which  is  not  a -fragment  variable  in  the  second  bag  is 
attempted.  -If  the  match  is  successful,  the  remainders  of  each  of 
the  bags  are  unified.  When  the  first  element  of  the  first  bag  is 
a fragment  variable,  each  possible  substitution  of  a subbag 
(collection  of  elements  from  a bag)  from  the  second  bag  is  tried. 
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In  either  case,  each  possible  substitution  for  any  fragment 
variable  in  the  second  bag  of  any  subbag  of  the  first  bag  wnich 
contains  the  -first  element  is  also  tried. 

1.  If  X is  of  the  form  (BAG)  and 

a.  y is  of  the  -form  (BAG  B)  then  let  e®{Y*(BAG)}  and 

return  {6o  | o c semi-unif  y ( x;  (BAG  s)0]}. 

b.  y is  not  of  the  -form  (BAG  *♦¥  B)  then  return  {}. 

2.  If  y is  of  the  -form  (BAG)  and 

a.  X is  of  the  form  (BAG  ♦♦X  a)  then  let  e={X*(BAG)}  and 
return  (eo  | o e semi-unify ( (BAG  o)8;  yl  } . 

b.  X is  not  of  the  form  (BAG  ♦♦X  a)  then  return  {}. 

3.  If  X is  of  the  form  (BAG  o a y)  and  y is  of  the  form  (BAG  B 

a 5)  for  arbitrary  element  a then  semi-unif y(  (BAG  o y);  (BAG 

B «)]. 

4.  If  X is  of  the  form  (BAG  ♦♦X  a)  then 

a.  for  each  fragment  variable  **Y  in  y (y  possibly  reordered 
is  of  the  form  (BAG  **Y  B))  and  for  each  partition  of  x 
into  (BAG  a y)  for  which  Y is  not  contained  in  a which 
does  include  first  element  ♦♦X  let  0*{Y*(BAG  a)}  and 
return  {Bo  | a e semi-unify ( (BAG  y)0;  (BAG  B)ei}  and 

b.  for  each  partition  of  y into  (BAG  S 6)  for  which  X is  not 
contained  in  B let  0*{X*(BAG  B))  and  return  {Bo  I o « 
semi-unif y I (BAG  a)B;  (BAG  <)B]}. 

5.  If  X is  of  the  -form  (BAG  a o)  where  element  a is  not  a 
fragment  variable  then 
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a.  for  each  fragirent  variable  **Y  in  y (y  possibly  reordered 

is  of  the  form  (BAG  e))  and  for  each  partition  of  x 

into  (BAG  a y)  for  which  Y is  not  contained  in  o which 
does  include  first  element  a let  0 = {Y-(BAG  o))  and  return 
{do  I o c semi- unif y [ (BAG  y)9;  (BAG  6)6]}  and 

b.  for  each  element  b in  y which  is  not  a fragment  variable 
(y  possibly  reordered  is  of  the  form  (BAG  b 6))  return  {00 
I 6 c semi-unify [ a;  b]  a 0 c semi-unify [ (BAG  a)6;  (BAG 
6)6]}. 

Notes: 

1.  If  one  of  the  bags  is  the  empty  bag  and  the  other  is  not, 
the  latter  must  be  composed  solely  of  fragment  variables  which 
must  be  bound  to  the  empty  bag,.  Otherwise,  there  is  no  unifier 
(steps  1,  2)  . 

2.  .If  two  elements  of  two  bags  are  identical,  the  remainders 
of  the  bags  are  unified  (step  3). 

3.  .If  the  first  element  of  x is  a fragment  variable  or  any 
element  of  y is  a fragment  variable,  each  substitution  of  any 
subbag  of  the  opposing  bag  not  containing  the  variable  and 
substituting  for  or  including  in  the  substitution  the  first 
element  of  x is  performed  and  the  remainders  of  the  bags  unified 
for  each  substitution  (steps  4a,  4b,  5a). 

4.  If  the  'first  element  of  x is  not  a fragment  variable,  it  is 
unified  with  each  element  of  y which  is  not  a fragment  variable 

and  the  remainders  of  the  bags  are  unified  for  each  unifier  of  I 

the  two  elements  (step  5b)  . 1 
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3.7.  Class  semi- unification 

The  semi-unifyclass  procedure  returns  an  incomplete  list  of 
unifiers  for  two  classes. 

Since  class  unification  requires  an  element  of  one  class 
possibly  to  be  matched  against  more  than  one  element  of  another 
class,  matched  elements  cannot  be  simply  removed  and  disregarded 
as  in  the  vector  and  bag  semi-unification  algorithms.  'Previously 
matched  elements  must  be  retained.  They  may  be  kept  in  an 
additional  variable  in  the  algorithm  or,  for  simplicity  of 
description  and  implementation,  be  retained  in  the  original 
classes  themselves  as  in  the  algorithm  being  presented  here.  The 
following  conventions  are  followed  by  the  algorithm:  previously 
matched  elements  of  two  classes  are  retained  in  the  right  end 
(tail)  of  the  two  classes  and  the  leftmost  of  any  two  duplicate 
elements  in  a single  class  is  immediately  eliminated.  Two 
classes  are  fully  unified  when  they  are  identical  (all  the 
elements  are  previously  matched  elements  in  the  common  tail  of 
the  two  classes)  . 

The  semi-unifyclass  procedure,  when  presented  with  two 
non-empty  classes  with  no  previously  unmatched  common  elements  or 
duplicate  elements  .within  a single  class,  always  matches  the 
first  element  of  the  first  class  at  each  step  after  a possible 
interchange  of  the  two  classes.  Tne  interchange  is  a 
normalization  operation  whose  result  is  that  the  first  element  of 
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the  first  class  after  the  interchange  is  guaranteed  to  be 
previously  unmatched. 

Class  semi-unification  proceeds  almost  exactly  as  does  bag 
semi-unification  except  for  the  retention  of  matched  expressions 
at  the  right  end  of  the  classes  and  (thus)  the  ability  to  match  a 
previously  unmatched  element  against  elements  previously  matched. 

1.  If  X is  of  the  form  (CLASS  o ay  a c)  for  arbitrary  element 
a then  semi-unify[ (CLASS  o y a c)f  yl- 

2.  If  y is  of  the  -form  (CLASS  6 a 6 a c)  for  arbitrary  element 
a then  semi-unif y [ x;  (CLASS  6 6 a c)]. 

3.  If  X is  of  the  form  (CLASS  o a y)  and  y is  of  the  form 
(CLASS  B a 5)  for  arbitrary  element  a and  y^6  then 
semi-unifyl (CLASS  o y a) ; (CLASS  s « a)]. 

4.  If  X is  of  the  form  (CLASS  a)  and  y is  of  the  -form  (CLASS  3 
a)  then  semi-unify [y;  x]  . 

5.  If  X is  of  the  form  (CLASS  a)  then 

a.  for  each  fragment  variable  ♦♦Y  in  y (y  possibly  reordered 
is  of  the  form  (CLASS  ♦♦Y  B))  and  for  each  partition  of  x 
into  (CLASS  a y)  for  which  Y is  not  contained  in  a wnich 
does  include  first  element  ♦♦X  let  e»lY»(CLASS  a)}  and 
return  {eo  | o e semi-unify ( (CLASS  y a)e;  (CLASS  B a)8]} 
and 

b.  for  each  partition  of  y into  (CLASS  B 6)  for  which  X is 
not  contained  in  B let  e>{X<- (CLASS  B)}  and  return  {ea  I o 
e semi-unif y ( (CLASS  a B)e;  (CLASS  6 B)6]}. 
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6.  If  X is  of  the  form  (CLASS  a a)  v/here  element  a is  not  a 


fragment  variable  then 

a.  for  each  .fragment  variable  ♦♦Y  in  y (y  possibly  reordered 

is  of  the  form  (CLASS  ♦♦Y  B))  and  for  each  partition  of  x 
into  (CLASS  a Y)  for  which  Y is  not  contained  in  a which 
does  include  first  element  a let  e*lY*(CLASS  o)}  and 
return  | a g semi-unify ( (CLASS  Y «)8;  (CLASS  6 o)e]} 

and 

b.  for  each  element  b in  y which  is  not  a fragment  variable 
(y  possibly  reordered  is  of  the  form  (CLASS  b B))  return 

' {00  I 0 e semi-unify ( a ; b]  a a e semi-unify ( (CLASS  a a)0; 

(CLASS  B a) 0]  } . 

Notes.: 

1.  If  a class  has  a repeated  element,  the  .first  occurrences  of 
that  element  are  deleted  (steps  1,  2)., 

2.  If  two  elements  of  two  classes  not  in  the  common  tail  of 
the  classes  (not  previously  matched)  are  identical,  the  classes 
with  the  elements  moved  to  the  end  are  unified  (step  3). 

3.  To  insure  that  the  next  substitution  operation  use  a 
previously  unmatched  element,  the  classes  are  interchanged  if  the 
second  is  a tail  of  the  .first  (step  4)  . 

4.  If  the  .first  element  of  x is  a fragment  variable  or  any 
element  of  y is  a fragment  variable,  each  substitution  of  any 
subclass  of  the  opposing  class  not  containing  the  variable  and 
substituting  .for  or  including  in  the  substitution  the  first 
element  of  x is  performed  and  the  remainder  of  the  classes  are 
unified  for  each  substitution  (steps  5a,  5b,  6a). 
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5.  If  the  first  element  of  x is  not  a fragment  variable,  it  is 
unified  with  each  element  of  y which  is  not  a fragment  variable 
and  the  remainders  of  the  classes  are  unified  for  each  unifier  of 
the  two  elements  (step  6b). 

3.8.  Examples  of  semi-unif ication 

The  following  are  actual  traces  of  the  semi-unify, 
semi-unifyvector , semi-unif  ybag  , and  semi-unifyclass  procedures 
as  applied  to  the  problem  of  unifying  two  expressions  whose 
elements  are  respectively  (1)  A,  ♦X,  ♦♦Y,  and  and  (2)  A,  B, 

B,  and  C.  The  semi-unification  of  vectors,  bags,  and  classes, 
each  consisting  of  this  same  collection  of  elements,  illustrates 
the  differences  in  the  semi-unification  algorithms  employed  for 
the  different  data  types  and  the  differing  complexity  of  the 
problems  as  evidenced  by  the  number  of  returned  unifiers. 

These  examples  do  not  illustrate  all  the  features  or  the  full 
complexity  of  the  semi-unification  algorithms  since  the  second 
expression  is  variable  'free.  Thus,  the  results  are  an  instance 
of  pattern  matching  as  well  as  unification.  The  lacic  of 
variaoles  (particularly  the  lack  of  fragment  variables)  in  the 
second  expression  guarantees  that  the  set  of  unifiers  returned  is 
a complete  set,  i.e.,  no  other  unifiers  are  possible. 

The  traced  algorithms  differ  slightly  from  the  preceding 
definitions  affecting  primarily  the  order  in  which  operations  are 
done . 


I 
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CNTER  UNIPYX; 

! X = (VECTOR  A «.X  — Y ♦♦Z) 

! Y » (VECTOR  A B B C) 

I ENTER  SEUIUl'lIFYVECTOR: 

II  X - (A  *X  ♦♦Y  ,-Z) 

1 I Y = (A  B B C) 

! ! ENTER  SEMIUl'lIFYVECTOR: 

! ! ! X = (♦X  **Y  ♦♦Z) 

! ! ( Y = (B  B C) 

! ! ! ENTER  UNIFYX: 

! ! ! 1 X = ♦X 

! ! 1 ! Y = B 

1 ! ! UNIFYX  = (( (X  . B) ) ) 

! ! ! ENTER  SEMIUN IFYVECTOR; 

! ! ! ! X * (♦♦Y  *^Z) 

! ! ! ! Y » (B  C) 

! ! ! 1 ENTER  SEi-lIUIilFYVECTOR: 

1 ! ! ! I X * (*»Z) 

! ! ! ! ! Y » (B  C) 

! ! ! ! ! ENTER  SEHIUNIF YVECTOR; 

1 ! ! ! 1 1 X = NIL 

!!!!!!  Y = (B  C) 

! ! 1 ! ! SEMIUl'lIFYVECTOR  = NIL 

1 ! ! ! ! ENTER  SEMIUN.IF YVECTOR: 

! I 1 1 ! ! X = NIL 

!!!!!!  Y = (C) 

! 1 1 1 ! SEMIUN.IF  YVECTOR  = NIL 

I I I I I ENTER  SEW-IUMIF YVECTOR: 

!!!!!!  X » NIL 

1 ! 1 ! 1 1 Y » NIL 

! ! ! I I SEMIUN.IFYVECTOR  = ( ( ( Z VECTOR  B C)  (Y  VECTOR)  (X  . B)  ) ) 

! ! ! ! SEMIUMIF YVECTOR  = ( ( (.Z  VECTOR  B C)  (Y  VECTOR)  (X  . B)  ) ) 

1111  ENTER  SEHIUN.IFYVECTOR: 

! i ! ! ! X » (*-^Z) 

! ! ! ! ! Y = (C) 

! ! 1 ! 1 ENTER  SEMIUNIF YVECTOR: 

! 1 ! ! I 1 X - NIL 

!!!!!!  Y - (C) 

! 1 ! I 1 .SEMIUMIF YVECTOR  » NIL 

! ! ! 1 1 ENTER  SEMIUNIF YVECTOR: 

!!!!!!  X » NIL 

1 ! ! ! ! I Y « NIL 

1 ! ! ! 1 SEMIUMIF YVECTOR  « (((Z  VECTOR  C)  (Y  VECTOR  B)  (X  . B)  ) ) 

! 1 ! ! SEMIUMIFYVECTOR  » ( ( (Z  VECTOR  C)  (Y  VECTOR  B)  (X  . B) ) ) 

! ! ! ! ENTER  SEMIUMIFYVECTOR: 

! ! 1 ! ! X « (**Z) 

! ! ! 1 ! Y - NIL 

! ! ! 1 ! ENTER  SEi-lIUMIF YVECTOR: 

! ! ! 1 ! 1 X « NIL 

! ! 1 1 1 ! Y « NIL 

111!!  SEMIUMIFYVECTOR  « (((Z  VECTOR)  (Y  VECTOR  3 C)  (X  . B)  ) ) 
! ! 1 I SEMIUMIFYVECTOR  * ( ( (.Z  VECTOR)  (Y  VECTOR  B C)  (X  . B)  ) ) 

! ! ! SEMI UN IF YVECTOR  « (((Z  VECTOR  B C)  (Y  VECTOR)  (X  . B)  ) 

((Z  VECTOR  C)  (Y  VECTOR  B)  (X  . B) ) 
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(CZ  VECTOR)  (Y  VECTOR  B C)  (X  . B)  ) ) 
! ! SEMIUfilFYVECTOR  = (((Z  VECTOR  B C)  (Y  VECTOR)  (X  . B)  ) 

((Z  VECTOR  C)  (Y  VECTOR  B)  (X  . B) ) 

((Z  VECTOR)  (Y  VECTOR  B C)  (X  . B) ) ) 

! SEMIUNIFYVECTOR  » (((Z  VECTOR  B C)  (Y  VECTOR)  (X  . B) ) 

((Z  VECTOR  C)  (Y  VECTOR  B)  (X  . B)  ) 

((Z  VECTOR)  (Y  VECTOR  B C)  (X  . B) ) ) 

UNIFYX  * (((Z  VECTOR  B C)  (Y  VECTOR)  (X  . B) ) 

{(Z  VECTOR  C)  (Y  VECTOR  B)  (X  . B)  ) 

((Z  VECTOR)  (Y  VECTOR  B C)  (X  . B) ) ) 

Example  3^3. 

ENTER  UNIFYX: 

! X » (BAG  A *X  — Y **Z) 

! Y > (BAG  A B B C) 

! ENTER  SEHIUN.IFYBAG: 

! 1 X « (A  *X  — Y **Z) 

! ! Y » (A  B B C) 

I I ENTER  UN.IFYX: 

! ! ! X » *X 

III  Y * B 

! ! UNIFYX  = ( ( (X  . B)  ) ) 

! ! ENTER  SEMIUNIF YBAG: 

! ! ! X * (**Y  **Z) 

! ! 1 Y » (B  C) 

! ! ! ENTER  SEMIUN IFYBAG; 

! ! ! ! X * (**Z) 

111!  Y « NIL 

111!  ENTER  SEMIUNIFYBAG: 

•111!  X » NIL 

! 1 1 ! ! Y « NIL 

111!  SEMIUNIFYBAG  » (((Z  BAG)  (Y  BAG  B C)  (X  . B) ) ) 

1 I ! SEMIUNIFYBAG  » (((Z  BAG)  (Y  BAG  B C)  (X  . B)  ) ) 

1 1 ! ENTER  SEMIUNIFYBAG; 

111!  X * (*-Z) 

Jill  Y * (C) 

111!  ENTER  SEMIUNIFYBAG: 

1 1 1 1 1 X * NIL 

1 1 1 1 1 Y * NIL 

1 1 I I SEMIUNIFYBAG  » (((Z  BAG  C)  (Y  BAG  B)  (X  . B) ) ) 

111!  ENTER  SEMIUN, IFYBAG; 

1111!  X « NIL 

111!!  Y - (C) 

111!  SEMIUNIFYBAG  » NIL 

1 1 1 SEMIUNIFYBAG  - (((Z  BAG  C)  (Y  BAG  B)  (X  . B) ) ) 

1 1 1 ENTER  SEMIUNIFYBAG: 

111!  X = (•*Z) 

111!  Y - (B) 

111!  ENTER  SEMIUNIFYBAG: 

111!!  X » NIL 

1111!  Y » NIL 

111!  SEMIUNIFYBAG  « (((Z  BAG  B)  (Y  BAG  C)  (X  . B) ) ) 
111!  ENTER  SEi-lIUNIFYBAG; 
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1 ! i ! I X » NIL 

! J 1 1 I Y * (6) 

I ! ! I SEMIUNilFYBAG  * NIL 

1 ! 1 SErtlUMIFYSAG  » ( ( (Z  BAG  B)  (Y  BAG  C)  (X  . B) ) ) 

! • ! ENTER  SEMIUN IFYBAG: 

! ! ! ! X » (♦♦Z) 

I ! J ! Y » (B  C) 

! 1 i 1 ENTER  SEMIUNIFYBAG; 

! ! J ! I X * NIL 

! ! J ! ! Y * NIL 

1 ! ! 1 SEMIUNIFYBAG  = (({Z  BAG  B C)  (Y  BAG)  (X  . B)  ) ) 

! ! ! ! ENTER  SEMIUNIFYBAG: 

! 1 1 ! 1 X » NIL 

I I I I I Y ■ (C) 

i 1 ! 1 SEMIUNIFYBAG  = NIL 

) ! I ! ENTER  SEMIUNIFYBAG: 

I 1 I I ! X * NIL 

1 ! I 1 ! Y » (B) 

! ! ! ! SEMIUNIFYBAG  = NIL 

1 J 1 ! ENTER  SEMIUNIFYBAG: 

1 ! ! ! ! X « NIL 

1 ! ! ! ! Y » (B  C) 

111!  SEMIUNIFYBAG  * NIL 

1 1 1 SEMIUNIFYBAG  * (((Z  BAG  B C)  (Y  BAG)  (X  . B)  ) ) 

1 1 SEMIUNIFYBAG  * (((Z  BAG)  (Y  BAG  6 C)  (X  . B) ) 

( (Z  BAG  C)  (Y  BAG  B)  (X  . B) ) 

( (Z  BAG  8)  (Y  BAG  C)  (X  . B)  ) 

( (Z  BAG  B C)  (Y  BAG)  (X  . B)  ) ) 

1 1 ENTER  UN.IFYX: 

111  X » ♦X 

11!  Y » C 

1 1 UNIFYX  » ( ( (X  . C)  ) ) 

1 1 ENTER  SEMIUNIFYBAG: 

1 1 1 X « (— Y ♦*Z) 

111  Y * (B  B) 

1 1 1 ENTER  SEMIUNIFYBAG: 

111!  X = (**Z) 

1111  Y » NIL 

1111  ENTER  SEMIUNIFYBAG: 

I I I I I X » NIL 

11111  Y » NIL 

111!  SEMIUNIFYBAG  « (((Z  BAG)  (Y  BAG  B B)  (X  . C) ) ) 
1 1 1 SEMIUNIFYBAG  - (((Z  BAG)  (Y  BAG  B B)  (X  . C))) 

1 I 1 ENTER  SEMIUNIFYBAG: 

111!  X » (♦♦Z) 

1111  Y - (B) 

111!  ENTER  SEMIUNIFYBAG: 

1 1 1 1 1 X - NIL 

1 1 1 I 1 Y « NIL 

111!  SEMIUNIFYBAG  « (((Z  BAG  B)  (Y  BAG  B)  (X  . C))) 
1111  ENTER  SEMIUNIFYBAG: 

1111!  X - NIL 

1 1 1 1 1 Y - (B) 

111!  SEMIUNIFYBAG  « NIL 
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! ! I SEHIUWIFYBAG  = ( ( (Z  BAG  B)  (Y  BAG  B)  (X  . C) ) ) 

1 I I ENTER  SEHIUMIFYSAG; 

! 1 ! 1 X = (**Z) 

1 ! ! ! Y » (B  B) 

! 1 ! ! ENTER  SEHIUN.IFYBAG; 

! ! 1 ! ! X « NIL 

! ! 1 ! ! Y = NIL 

! ! ! ! SErtlUNIFYBAG  * ( ( (Z  BAG  B B)  (Y  BAG)  <X  . C))) 

! ! ) 1 ENTER  SEMIUN IF YBAG; 

! 1 ! ! 1 X » NIL 

1 ! ! i ! Y » (B) 

1 ! ! ! SEWIUN.IFYDAG  » NIL 

! ! ! 1 ENTER  SEMIUNIFYBAG; 

! 1 ! ! ! X = NIL 

! ! 1 ! 1 Y « (B  B) 

I ! ! ! SEMIUNIFYBAG  » NIL 

! ! ! SEMIUNIFYBAG  » (((Z  BAG  B B)  (Y  BAG)  (X  . C))) 

! ! SEMIUNIFYBAG  » (((Z  BAG)  (Y  BAG  B B)  (X  . C) ) 

({Z  BAG  B)  (Y  BAG  B)  (X  . C)  ) 

( (Z  BAG  B B)  (Y  BAG)  (X  . C)  ) ) 

1 SEMIUNIFYBAG  » (((Z  BAG)  (Y  BAG  B C)  (X  . B) ) 

( (Z  BAG  C)  (Y  BAG  B)  (X  . B)  ) 

( (Z  BAG  B)  (Y  BAG  C)  (X  . B)  ) 

( (Z  BAG  B C)  (Y  BAG)  (X  . B)  ) 

((.Z  BAG)  (Y  BAG  B B)  (X  . C)) 

((.Z  BAG  B)  (Y  BAG  B)  (X  . C)  ) 

(CZ  BAG  B B)  (Y  BAG)  (X  . C)  ) ) 

UNIFYX  * {((Z  BAG)  (Y  BAG  B C)  (X  . B) ) 


( (Z 

BAG 

C) 

(Y 

BAG 

B) 

(X 

B)  ) 

( (Z 

BAG 

B) 

(Y 

BAG 

C) 

(X 

B)  ) 

( (Z 

BAG 

B 

C) 

(Y 

BAG) 

(X 

B)  ) 

( (Z 

BAG) 

(Y  BAG 

B 

B) 

(X 

C)) 

( (2 

BAG 

B) 

(Y 

BAG 

B) 

(X 

C)) 

((■z 

BAG 

B 

B) 

(Y 

BAG) 

(X 

C))) 

Examole  3^4. 

ENTER  UNIFYX: 

! X - (CLASS  A ♦X  —Y  —Z) 

! Y = (CLASS  ABC) 

! ENTER  SEMIUNIFYCLASS: 

! ! X * (A  *X  ♦♦Y  ♦♦2) 

i 1 Y * (A  B C) 

! ! ENTER  UNIFYX: 

111  X » *X 

! I ! Y « B 

i ! UNIFYX  - ( ( (X  . B)  ) ) 
t ! ENTER  SEM,IUN.IFYCLASS: 

1 ! ! X » (*^Y  A B) 

! ! ! Y » (C  A B) 

I ! ! ENTER  SEMIUNIFYCLASS: 

! ! ! ! X * (**Z  B A C B A) 

! ! ! 1 Y = (C  B A) 

1111  ENTER  SEMIUNIFYCLASS: 
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! X = (A  B C A B 

! Y = (A  B C) 

SEHIUwIFYCLASS  = (((Z 
ENTER  SEHIUNIFYCLASS: 
! X = (A  B C A B) 

! Y = (C  A 3) 

SEMIUNIFYCLASS  = (((Z 
ENTER  3EMIUW.IFYCLASS: 
1 X * (A  B C A C) 

! Y = (B  A C) 

SEMIUN.IFYCLASS  = ( ( ( Z 
ENTER  SEHIUN.IFYCLASS: 
! X = (A  B C A) 

! Y = (B  C A) 

SERIUNIFYCLASS  = (({Z 


C) 

CLASS  ABC)  (Y  CLASS  C B A)  (X  . 3)  ) ) 

CLASS  A B)  (Y  CLASS  C i3  A)  (X  . E|  ) ) 

CLASS  A C)  (Y  CLASS  C B A)  (X  . c)  ) ) 

CLASS  A)  (Y  CLASS  C d A)  (X  . B)  ) ) 


ENTER  SEMIUNIFYCLASS: 

! X = (A  B C B C) 

I Y = (A  B C) 

SEF-IUNIFYCLASS  = (((Z  CLASS  3 C)  (Y  CLASS  C d A)  (X  . E)  ) ) 
ENTER  SEMIUNIFYCLASS: 

! X = (A  B C 3) 

! Y = (A  C B) 

SEMIUNIFYCLASS  = {((Z  CLASS  B)  (Y  CLASS  C 6 A)  (X  . 3)  J ) 
ENTER  SEWIUNIFYCLASS: 

I X = (A  a c c) 

I Y = (A  a C) 

SEI-IIUNIFYCLASS  * (((Z  CLASS  C)  (Y  CLASS  C B A)  (X  . B)  ) ) 
ENTER  SE*-1IUNIFYCLASS: 

1 X = (A  a c) 

! Y = (A  B C) 


SEMIUNIFYCLASS 
SEMIUNIFYCLASS  = 


= (((Z  CLASS)  (Y  CLASS  C B A)  (X 
(((Z  CLASS  ABC)  (Y  CLASS  C B A) 


( (Z  CLASS 
((.Z  CLASS 
((Z  CLASS 
((Z  CLASS 
((Z  CLASS 
((Z  CLASS 
((Z  CLASS) 

ENTER  SEMIUNIFYCLASS; 


A B) 

A C) 

A)  (Y 
B C) 

B)  (Y 

C)  (Y 


(Y  CLASS 
(Y  CLASS 
CLASS  C 
(Y  CLASS 
CLASS  C 
CLASS  C 


(Y  CLASS  C B 


C 

c 

B 

C 

B 

B 

A) 


B A) 

B A) 
A)  (X 
B A) 
A)  (X 
A)  (X 
(X  . 


• B))) 

(X  . B)) 
. B)  ) 

. B)) 

B)  ) 

• B)) 

. B)  ) 

. B)) 

B)  ) ) 


(X 

(X 

(X 


X = (^-Z  B A C B) 

Y = (A  C B) 

ENTER  SEMIUNIFYCLASS; 

I X = (B  C A a C A) 

1 Y = (B  C A) 

SEMIUNIFYCLASS  = (((Z  CLASS  B C A)  (Y  CLASS  C B)  (X  . E)  ) ) 
ENTER  SEMIUNIFYCLASS: 


I X » (B  C A B C) 

! y = (A  B C) 

SEMIUNIFYCLASS  * (((Z  CLASS  3 C)  (Y  CLASS  C B)  (X  . B)  / ) 
ENTER  SEM.IUNIFYCLASS: 

! X = (B  C A B A) 

! Y * (C  B A) 

SEMIUNIFYCLASS  = (((Z  CLASS  B A)  (Y  CLASS  C B)  {X  . B)  , ) 
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! ! ENTER  SEMIUNIFYCLASS : 

! ! ! X * (B  C A B) 

1 ! ! 1 Y = (C  A B) 

! SEWIUNIFYCLASS  = ( { (Z 
! ENTER  SEl-lIUNIFYCLASS: 
! 1 X - (B  C A C A) 

! Y = (B  C A) 

SEMIUNIFYCLASS  « {((Z 
ENTER  SEMIUNIFYCLASS; 
! X « (B  C A C) 

! Y » (B  A C) 

SEMIUNIFYCLASS  » {((Z 
ENTER  SEillUNIFYCLASS: 
! X » (B  C A A) 

I Y « (B  C A) 

SEMIUNIFYCLASS  » (((Z 
ENTER  SEM,IUNIFYCLASS: 
! X » (B  C A) 

! Y * (B  C A) 

1 SEMIUNIFYCLASS 


CLASS  B)  (Y  CLASS  C B)  (X  . B)  ) ) 


CLASS  C A)  (Y  CLASS  C B)  (X  . B)  ) ) 


CLASS  C)  (Y  CLASS  C B)  (X  . B)  ) ) 


CLASS  A)  (Y  CLASS  C B)  (X  . B)  ) ) 


SEMIUNIFYCLASS 


* (( (Z  CLASS) 

( ( (Z  CLASS  B C 
((Z  CLASS  B C) 
CLASS  B A) 
CLASS  B)  (Y 
CLASS  C A) 
CLASS  C)  (Y 
CLASS  A)  (Y 
CLASS) 


( (Z 
( (Z 
((Z 
((Z 
((Z 
((Z 


(Y  CLASS  C B) 
A)  (Y  CLASS  C 
(Y  CLASS  C B) 
(Y  CLASS 
CLASS  C 
(Y  CLASS 
CLASS  C 
CLASS  C 


(Y  CLASS  C B) 


C B) 
B)  (X 
C B) 
3)  (X 
B)  (X 
(X  . 


(X 

B) 

(X 

(X 

(X 


. B)  ) ) 
(X  . B) 
. B)) 

. B)) 

B)  ) 

. B) 

. B)) 

. 3)) 

B)  ) ) 


) 


truncated  here  for  brevity;  total  of  132  unifiers  returned 


3.9.  Termination  and  soundness  of  semi-unification 

Three  properties  are  reouired  of  the  unification  procedures: 

iS£i2iQ2£i2D»  §2yo4DS§§»  £2EBi,S££QSI§*  termination 

property  assures  us  that  the  procedure  terminates  after  a finite 
number  of  steps  and  incidentally  also  assures  us  that  the  number 
of  unifiers  returned  is>finite.  Tne  soundness  property  assures 
us  that  all  the  substitutions  returned  really  do  unify  the  two 
input  expressions.  The  completeness  property  assures  us  a 
complete  set  of  unifiers  is  returned,  i.e.,  no  unifier  of  the  two 
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expressions  is  unrepresented  by  some  member  of  the  cetutnea 
unifiers.  The  semi-unification  procedures  are  incomplete  oy 
design;  we  will  later  prove  that  semi-unification  in  combination 
with  variable  splitting  is  complete. 

lii*  procedure  serai-unify  is  sound  and  always 

terminates . 

Proof.  Our  proof  relies  on  the  fact  that  each  principal  step 
of  the  semi-unification  procedures  either  reduces  the  number  of 
variables  occurring  in  the  input  expressions  (while  possibly 
increasing  the  total  number  of  elements)  or  reduces  the  total 
number  of  elements.  .In  the  case  of  class  semi-unification,  the 
procedures  reduce  the  number  of  previously  unmatched  variables  or 
total  number  of  previously  unmatched  elements.  Thus,  the  number 
of  variables  in  the  two  expressions  is  constantly  declining,  and 
the  total  number  of  (unmatched)  elements  in  the  expressions  is 
reduced  between  removals  of  variables. 

.Induction  will  be  performed  on  the  tuple  (nj^,n2)  where  nj^  is 
the  number  of  distinct  variables  occurring  in  the  expressions  and 
n2  is  the  total  number  of  elements  occurring  in  the  expressions 
excluding  those  already  matched  as  elements  of  classes. 

(mj^,m2)  is  defined  to  be  less  than  (nj^,n2)  (written 
(m^ ,m2 ) < ( n^f n2) ) if  and  only  if  m^  is  less  than  n^,  or  m2  is 
equal  to  n2  and  m2  is  less  than  n2.  Thus,  the  measure  for  one 
pair  of  expressions  is  less  than  that  for  another  if  the  first 
has  fewer  variables,  or  the  same  number  of  variables  but  fewer 
el ements . 
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(OfO)  . If  02=0,  then  neither  expression  has  any 
(unmatched)  elements.  Tnus,  each  roust  be  an  empty  vector,  an 
empty  bag,  or  a class  with  all  elements  matched.  If  they  are  not 
both  empty  vectors,  not  both  empty  bags,  or  not  both  classes  with 
all  elements  matched,  then  the  empty  set  of  substitutions  is 
returned  (semi-unify  steps  5b,  6b,  7b).  Thus,  for  this  case 
semi-unify  immediately  terminates  and  is  sound  since  no  proposed 
unifiers  are  returned.  If  they  are  both  empty  vectors,  both 
empty  bags,  or  both  classes  with  all  elements  matched  (i.e., 
identical  classes)  , the  null  substitution  is  returned  as  the  only 
unifier  (semi-unify  step  1).  Thus  in  this  case  also,  semi-unify 
immediately  terminates.  It  is  also  sound  since  the  null 
substitution  is  being  returned  only  if  the  two  expressions  are 
equal . 

Assume  semi-unify  always  terminates  and  is  sound  for  all  pairs 
of  expressions  with  measure  (n2,n2)  < (n»j^,m2)  . Prove  it  terminates 
and  is  sound  -for  ( n^^,  n2)  • (m^^  ,m2 ) . Assume  ( n^^ , n2)  = (m^^  ,m2 ) for  two 
expressions.  Each  of  the  input  expressions  may  be  a constant, 
simple  variable,  vector,  bag,  or  class. 

1.  If  both  expressions  are  constants,  either  the  empty  set  of 
substitutions  is  returned  (semi-unify  step  4)  or  the  null 
substitution  is  returned  as  the  only  unifier  (semi-unify  step  1). 
In  either  case,  semi-unify  immediately  terminates.  A 
substitution,  the  null  substitution,  is  returned  only  if  the 
constants  are  equal.  Therefore,  semi-unify  is  sound  for  this 
case . 
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2.  If  one  expression  is  a constant  and  the  other  is  a simple 
variable,  the  substitution  of  the  constant  for  the  variable  is 
returned  as  the  only  unifier  (semi-unify  steps  2a,  3a).  Thus, 


semi-unify  immediately  terminates  for  this  case  and  is  sound 
since  the  instantiation  of  the  variable  by  the  returned 
substitution  results  in  the  constant  of  the  other  expression. 

3.  If  both  expressions  are  simple  variables,  either  the  null 
substitution  is  returned  as  the  only  unifier  (semi-unify  step  1) 
or  the  substitution  of  the  second  variable  for  the  first  is 
returned  as  the  only  unifier  (semi-unify  step  2a).  In  either 
case,  semi-unify  immediately  terminates.  Semi-unify  is  sound  for 
this  case  since  it  only  returns  the  null  substitution  if  the  two 
variables  are  identical,  and  if  they  are  not  identical  the 
substitution  of  one  for  the  other  results  in  the  two  instantiated 
expressions  being  identical. 

4.  If  one  expression  is  a constant  and  the  other  is  a vector, 
bag,  or  class,  the  empty  set  of  substitutions  is  returned 
(semi-unify  step  4).  In  this  case,  semi-unify  immediately 
terminates  and  is  sound  since  no  unifiers  are  returned. 

5.  If  one  expression  is  a simple  variable  and  the  other  is  a 
vector,  bag,  or  class,  either  the  substitution  of  the  vector, 
bag,  or  class  for  the  variable  is  returned  as  the  only  unifier 
(semi-unify  steps  2bl,  2cl,  2dl,  3bl,  3cl,  3dl ) or  the  vector, 
bag,  or  class  is  unified  with  a vector,  bag,  or  class  with  new 
fragment  variable  elements  substituted  for  the  simple  variable 
argument  (semi-unify  steps  2b2 , 2c2,  2d2,  3b2,  3c2,  3d2).  In  the 
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first  case,  semi-unify  immediately  terminates  and  is  sound  since 
the  substitution  of  the  vector,  bag,  or  class  for  the  variable 
results  in  the  instantiated  expressions  being  equal.  The  second 
case  will  be  treated  further  in  cases  7,  b,  and  9.  No  extra 
difficulty  with  termination  occurs  here  since  the  replacement  of 
the  simple  variable  by  a vector,  bag,  or  class  can  only  occur 
once.  Any  unifier  of  the  expression  derived  from  the  simple 
variable  and  the  other  expression  is  also  a unifier  of  the  simple 
J variable  and  the  other  expression.  Therefore  the  recursive  call 

' is  sound. 

6.  If  both  expressions  are  vectors,  bags,  or  classes,  but  not 
both  of  the  same  type,  the  empty  set  of  substitutions  is  returned 
(semi-unify  steps  5b,  6b,  7b).  In  this  case,  semi-unify 
immediately  terminates  and  is  sound  since  no  substitutions  are 
returned . 

7.  If  both  expressions  are  vectors,  either  (a)  the  null 
substitution  is  returned  as  the  only  unifier  (semi-unify  step  1) , 
(b)  the  empty  set  of  substitutions  is  returned  ( semi-uni fyvector 
steps  lb,  2b),  (c)  two  identical  elements  are  removed  and  the 
remaining  subvectors  are  semi-unified  ( semi-unif yvector  step  3), 
(d)  substitutions  are  made  for  a fragment  variable  and  the 
instantiated  remaining  subvectors  are  semi-unified 
(semi-unifyvector  steps  la,  2a,  4a,  4b,  5a),  or  (e)  two 
expressions  are  semi-unified  and  the  instantiated  remaining 
subvectors  are  semi-unified  (semi-unifyvector  step  5b). 

7a.  In  subcase  a,  semi-unify  immediately  terminates  and  is 
sound  since  any  substitution  does  unify  identical  expressions. 

207 

!/ 


I 


7b.  In  subcase  b,  semi-uni fyvector  immediately  terminates  and 
is  sound  since  no  substitutions  are  returned. 

7c.  In  subcase  c,  semi-unif yvector  is  called  recursively  with 
expressions  with  measure  (xj^,X2)  with  x^^n^  and  X2_<n2-2.  Thus, 
since  ( , X2 ) < ( n^,  n2)  » by  the  inductive  assumption  this  recursive 

call  terminates  and  is  sound.  Any  unifier  of  two  vectors  is  also 
a unifier  of  the  same  two  vectors  preceded  by  identical  elements. 
Tnerefore,  semi-unif yvector  is  sound  for  this  subcase. 

7d.  In  subcase  d,  semi-unifyvector  is  called  recursively  with 
expressions  with  measure  (n^-l/X)  for  some  x.  Thus,  since 
(nj^-l.x)  < (nj^,n2)  for  all  x,  by  the  inductive  assumption  this 
recursive  call  terminates  and  is  sound  and  the  returned 
substitutions  unify  the  remainder  vectors.  The  composition  of 
the  substitution  of  the  initial  subvector  for  the  fragment 
variable  and  a returned  unifier  for  the  remaining  vectors  unifies 
the  original  vectors.  Therefore,  semi-unifyvector  is  sound  for 
this  subcase. 

7e.  In  subcase  e,  semi-unify  is  called  recursively  with 
expressions  with  measure  (Xj^,X2)  with  X2_<n2  and  X2<n2.  If  called 
at  all  (if  semi-unify(a;b]  succeeds),  semi-unifyvector  is  called 
recursively  with  expressions  with  measure  (Xj^,X2)  with  Xj^<n2  for 
some  X2  (since  if  the  first  elements  are  unifiable  but  not 
identical  at  least  one  variable  must  have  been  substituted  for). 
Thus,  by  the  inductive  assumption,  each  recursive  call  terminates 
and  is  sound.  The  composition  of  the  returned  substitutions 
unify  the  unified  remainder  vectors  preceded  by  the  first 
elements  and  semi-unifyvector  is  sound  for  this  subcase. 
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8.  If  both  expressions  are  bags,  either  (a)  the  null 
substitution  is  returned  as  the  only  unifier  (semi-unify  step  1) , 
(b)  the  empty  set  of  substitutions  is  returned  ( semi-unifybag 
steps  lb,  2b)  , (c)  two  identical  elements  are  removed  and  the 
remaining  subbags  are  semi-unified  (semi-unifybag  step  3),  (d) 
substitutions  are  made  for  a fragment  variable  and  the 
instantiated  remaining  subbags  are  semi-unified  (semi-unifybag 
steps  la,  2a,  4a,  4b,  5a),  or  (e)  two  expressions  are 
semi-unified  and  the  instantiated  remaining  subbags  are 
semi-unified  (semi-unifybag  step  5b) . 

8a.  In  subcase  a,  semi-unify  immediately  terminates  and  is 
sound  since  any  substitution  does  unify  identical  expressions. 

8b.  In  subcase  b,  semi-unifybag  immediately  terminates  and  is 
sound  since  no  substitutions  are  returned. 

8c.  In  subcase  c,  semi-unifybag  is  called  recursively  with 
expressions  with  measure  (Xj^,X2)  with  and  X2£n2-2.  Thus, 

since  ( Xj^  ,X2 ) < (n^^,  02)  f by  the  inductive  assumption  this  recursive 
call  terminates  and  is  sound.  Any  unifier  of  two  bags  is  also  a 
unifier  of  the  same  two  bags  with  identical  elements  added. 
Therefore,  semi-unifybag  is  sound  for  this  subcase. 

8d.  In  subcase  d,  semi-unifybag  is  called  recursively  with 
expressions  with  measure  (nj-l,x)  for  some  x.  Thus,  since 
(nj^-1  ,x)  < (n2»n2)  for  all  x,  by  the  inductive  assumption  this 
recursive  call  terminates  and  is  sound.  The  composition  of  the 
substitution  of  the  subbag  for  the  fragment  variable  and  a 
returned  unifier  for  the  remaining  bags  unifies  the  original 
bags.  Therefore,  semi-unifybag  is  sound  for  this  subcase. 
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8e.  In  subcase  e,  semi-unify  is  called  recursively  with 
expressions  with  measure  with  Xj^^n^  and  X2<n2.  J£  called 

at  all  (if  semi-unify  ( a ;b)  succeeds),  semi-unif  ybag  is  called 
recursively  with  expressions  with  measure  (Xj^,X2)  with  X2<n2  for 
some  X2  (since  if  the  matched  elements  are  unifiable  but  not 
identical  at  least  one  variable  must  have  been  substituted  for). 
Thus,  by  the  inductive  assumption,  each  recursive  call  terminates 
and  is  sound.  The  composition  of  the  returned  substitutions 
unify  the  unified  remainder  bags  with  matched  elements  added  and 
semi-unif ybag  is  sound  for  this  subcase. 

9.  If  both  expressions  are  classes,  either  (a)  the  null 
substitution  is  returned  as  the  only  unifier  (semi-unify  step  1) , 
(b)  one  of  two  identical  elements  in  a single  class  is  removed 
and  the  remaining  subclasses  semi-unified  (semi-unifyclass  steps 
1,  2),  (c)  two  identical  elements  are  matched  against  each  other 
and  the  remaining  subclasses  are  serai-unified  (semi-unifyclass 
step  3),  (d)  substitutions  are  made  for  a fragment  variable  and 
the  instantiated  remaining  subclasses  are  semi-unified 
(semi-unifyclass  steps  5a,  5b,  6a) , or  (e)  two  expressions  are 
semi-unified  and  the  instantiated  remaining  subclasses  are 
semi-unified  (semi-unifyclass  step  6b) . The  last  two 
possibilities  may  occur  after  a one  time  interchange  of  the  two 
classes  (semi-unifyclass  step  4). 

9a.  In  subcase  a,  semi-unify  immediately  terminates  and  is 
sound  since  any  substitution  does  unify  identical  expressions. 

9b.  In  subcase  b,  semi-unifyclass  is  called  recursively  with 


expressions  with  measure  (Xj^,x2)  with  and  X2<n2-1.  Thus, 

since  ( Xj^ , X2 ) < { nj^,n2)  » by  the  inductive  assumption  this  recursive 
call  terminates  and  is  sound.  Any  unifier  of  two  classes  is  also 
a unifier  of  the  same  two  classes  with  a duplicate  element  added 
to  one  of  the  classes.  Therefore,  semi-unif yclass  is  sound  for 
this  subcase. 

9c.  In  subcase  c,  semi-unifyclass  is  called  recursively  with 
expressions  with  measure  (x^,x2)  with  x^^n^  and  X2<n2-2.  Thus, 

since  (Xj^,x2)<  (ni,n2)  » by  the  inductive  assumption  this  recursive 
call  terminates  and  is  sound.  (Note.  In  this  subcase,  and  in 
subcases  d and  e,  semi-unifyclass  is  called  recursively  with  the 
same  expressions  as  its  input  except  for  being  reordered  and 
instantiated  to  reflect  the  matching  performed.  The  reordering 
plus  the  restriction  by  step  4 of  the  procedure  that  pairs  of 
elements  in  the  identical  (previously  matched)  tails  of  the  two 
classes  not  be  matched  against  each  other  assures  termination. 

In  computing  (nj^,n2)  for  these  expressions,  only  variables  and 
elements  not  contained  in  the  identical  tails  are  counted.)  Since 
semi-unifyclass  is  called  recursively  with  the  identical 
expressions  except  for  reordering  to  reflect  restrictions  on 
future  matching,  the  unifiers  returned  by  the  recursive  call 
unify  the  input  expressions  and  semi-unifyclass  is  sound  for  this 
subcase . 

9d.  In  subcase  d,  semi-unifyclass  is  called  recursively  with 
expressions  with  measure  (n^-i^x)  for  some  x.  Thus,  since 
(''l-l  ,x)  < (n2,n2)  inductive  assumption  this 
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recursive  call  terminates  and  is  sound.  Since  semi-unifyclass  is 
called  recursively  with  the  identical  expressions  to  its  input 
except  for  instantiation  by  the  substitution  of  tlie  subclass  for 
the  fragment  variable  and  reordering  to  reflect  restrictions  on 
future  matching,  the  composition  of  the  substitution  of  the 
subclass  for  the  'fragment  variable  and  the  unifiers  returned  by 
the  recursive  call  unify  the  input  expressions  and 
semi-unifyclass  is  sound  for  this  subcase. 

9e.  In  subcase  e,  semi-unify  is  called  recursively  with 
expressions  with  measure  (Xj^,X2)  with  and  X2<n2.  If  called 

at  all  (if  semi-unify  ( a;b]  succeeds),  semi-unifyclass  is  called 
recursively  with  expressions  with  measure  (X2,X2)  with  X2<nj^  for 
some  X2  (since  if  the  matched  elements  are  unifiable  but  not 
identical  at  least  one  variable  must  have  been  substituted  for). 
Thus,  by  the  inductive  assumption,  each  recursive  call  terminates 
and  is  sound.  Since  semi-unifyclass  is  called  recursively  with 
the  identical  expressions  to  its  input  except  for  instantiation 
by  the  unifier  of  the  two  matched  elements  and  reordering  to 
reflect  restrictions  on  future  matching,  the  composition  of  the 
unifiers  returned  by  the  recursive  calls  unify  the  input 
expressions  and  semi-unifyclass  is  sound  for  this  subcase. 

Thus,  subject  to  the  inductive  assumption  that  semi-unify 
terminates  and  is  sound  for  all  pairs  of  expressions  with  measure 
(nj^ , 02)  < (mj^  ,m2 ) , semi-unify  terminates  and  is  sound  for  pairs  of 
expressions  with  measure  ( n^^, n2)  * (m^^  ,m2 ) . 

Therefore,  since  semi-unify  terminates  and  is  sound  for 
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(”l»n2)*  (0 , 0)  , semi-unify  terminates  and  is  sound  >for  every  pair 
of  express. ^ns.  G 

3.10.  Variable  splitting 

Variable  splitting  is  the  process  by  which  semi-unification 
can  be  extended  to  a complete  unification  algorithm. 

Variable  splitting  consists  of  replacing  single  fragment 
variables  by  two  newly  introduced  fragment  variables.  This 
introduction  of  additional  fragment  variables  will,  in  general, 
result  in  the  generation  of  additional  unifiers  by  the 
semi-unification  algorithm. 

For  example,  consider  the  problem  of  unifying  (VECTOR  ♦♦X  A) 
and  (VECTOR  A •►■►X)  . These  expressions  have  an  infinite  number  of 
unifiers  with  X assigned  vectors  consisting  of  an  arbitrary 
number  of  A*s.  However:,  the  semi-unification  algorithm  returns 
only  the  unifiers  .with  X assigned  the  empty  vector  and  the  vector 
consisting  of  a single  A since  fragment  variables  are  constrained 
by  the  algorithm  to  match  an  integral  number  of  elements  in  the 
opposing  expression. 

After  substitution  of  (VECTOR  --Xl  **X2)  for  fragment  variable 
X in  the  two  expressions,  the  expressions  are  (VECTOR  ♦♦Xl  ♦♦X2 
A)  and  (VECTOR  A »«X1  -*X2) . The  semi-unification  algorithm 
returns  the  unifiers  {XI  * (VECTOR) ,X2» (VECTOR) } , 

{XI ♦ (VECTOR) ,X2* (VECTOR  A) } , {XI  * (VECTOR  A ) ,X2 * (VECTOR) } , and 
{Xl*(V£CTOR  A)  ,X2» (VECTOR  A))  for  these  expressions.  The  first 
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three  of  these  unifiers  duplicate  the  previous  result  of  uni-fying 
(VECTOR  ♦.hX  A)  and  (VECTOR  A — -X)  , but  the  fourth  is  new 
representing  the  assignment  of  (VECTOR  A A)  to  X. 

It  is  clear  that  additional  applications  of  variable  splitting 
would  successively  result  in  the  generation  of  the  infinite 
sequence  of  unifiers  of  (VECTOR  ♦♦X  A)  and  (VECTOR  A ♦♦X)  . For 
example,  in  the  above  expressions  after  variable  splitting,  if 
(VECTOR  4-«-X3  «-«-X4)  were  substituted  for  variable  XI  or  X2  the 
additional  unifier  {X» (VECTOR  AAA)}  would  be  returned  by  the 
semi-unification  algorithm. 

Some  constraints  can  be  added  to  the  semi-unification  and 
variable  splitting  process  to  prevent  the  generation  of  or  to 
detect  and  eliminate  unifiers  duplicating  those  generated  prior 
to  a variable  splitting  operation. 

First,  no  variable  introduced  by  variable  splitting  should  be 
bound  to  the  empty  vector,  bag,  or  class.  If  this  occurred,  the 
other  variable  introduced  by  variable  splitting  would  be  treated 
identically  to  the  variable  replaced  by  variable  splitting  in  the 
previous  semi-unification  and  the  unifiers  produced  would 
duplicate  unifiers  generated  before  variable  splitting. 

This  constraint  is  readily  enforced  by  use  of  a simple 
syntactic  device:  using  ♦♦♦  prefix  variables  rather  than  ♦♦ 
prefix  variables  in  variable  splitting.  In  the  example  above, 
variable  splitting  would  result  in  the  formation  of  (VECTOR  ♦♦♦Xl 
•►•♦X2  A)  and  (VECTOR  A ♦♦♦Xl  ♦♦♦X2)  with  only  the  unifier 
|X1* (VECTOR  A)  ,X2* (VECTOR  A)}  returned  by  the  semi-unification 
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alaorithm  -aliminating  the  generation  of  the  three  duplicate 
unifiers. 

Second,  pairs  of  variables  introduced  by  a variable  splitting 
operation  should  not  collectively  be  assigned  a fewer  or  equal 
number  of  elements  to  the  maximum  number  of  elements  assigned  by 
the  semi-unification  algorithm  to  the  split  variable  before 
variable  splitting.  Restated,  the  split  variable  should  be 
assigned  a greater  number  of  elements  in  the  split  case  than  in 
the  unsplit  case. 

For  example,  consider  the  (degenerate)  case  of  unifying 
(VECTOR  ♦♦Y)  and  (VECTOR  A 3)  resulting  in  the  unifiers 

{X-^  (VECTOR)  ,Y-- (VECTOR  A B)  ) , {X*  (VECTOR  A),  Y*  (VECTOR  3)},  and 
{X» (VECTOR  A B)  ,Y*  (VECTOR) } being  returned  by  the 
semi-unification  algorithm.  Applying  variable  splitting  to 
variable  X of  these  two  expressions,  the  expressions  (using  the 
prefix  variable  device  suggested  above)  (VECTOR  ♦♦♦Xl  ♦♦♦X2 
♦ ♦Y)  and  (VECTOR  A B)  are  formed  with  unifier  {Xl»  (VECTOR 
A) ,X2* (VECTOR  B) ,Y» (VECTOR) } returned  by  the  semi-unification 
algorithm.  But  this  unifier  duplicates  the  third  unifier  of  the 
expressions  before  variable  splitting,  violating  the  condition 
that  XI  and  X2  be  collectively  assigned  more  than  two  elements. 

The  use  of  ♦♦♦  prefix  variables  in  variable  splitting  is  a 
special  case  of  this  constraint,  requiring  that  the  introduced 
variables  be  matched  (if  at  all)  to  at  least  two  elements. 

Third,  pairs  of  variables  introduced  by  variable  splitting 
must  not  be  universally  matched  together  against  variables  in  the 
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opposing  expression.  Otherwise,  they  are  being  treated  as  a unit 
and  their  being  derived  from  splitting  a variable  results  in  the 
generation  of  no  additional  unifiers. 

For  example,  consider  the  case  of  (VECTOR  •►•►X  ♦♦Y)  and  (VECTOR 
--V)  resulting  in  the  unifiers  {X-^  (VECTOR)  ,Y-- (VECTOR 
**V) ) , {X* (VECTOR  -rU) ,Y* (VECTOR  •♦V)),  {X- (VECTOR  **0 
**V)  ,Y- (VECTOR)  } , {U«- (VECTOR)  ,V- (VECTOR  ♦♦X  —Y)},  and  {U-(VECTOR 
♦♦Y) ,V» (VECTOR) } being  returned  by  the  semi-unification 
algorithm.  Applying  variable  splitting  to  variable  X of  these 
two  expressions,  the  expressions  (VECTOR  ♦♦•Xl  ♦■►-►X2  ♦■►Y)  and 
(VECTOR  -*0  **V)  are  formed  with  unifiers  {XI- (VECTOR 
♦♦U) ,X2- (VECTOR  **V) ,Y- (VECTOR) } , (U - (VECTOR) , V- (VECTOR  ---Xl 
---X2  --Y)},  {U- (VECTOR  Xl ) ,V- (VECTOR  ---X2  --Y)  } , (U- (VECTOR 
---XI  ---X2) ,V- (VECTOR  --Y) } , and  {U- (VECTOR  ---Xl  — X2 
--Y) ,V- (VECTOR) } returned  by  the  serai-unification  algorithm.  The 
first  of  these  unifiers  is  eliminable  by  the  second  constraint 
discussed  above;  the  second,  fourth,  and  fifth  are  eliminable  by 
the  current  third  constraint  since  ---XI  and  ---X2  were  always 
matched  together;  only  the  third  is  a new  unifier. 

3.11.  .Semi-unification  and  elaboration 

.It  is  useful  to  generalize  the  notions  of  semi-unification  and 
variable  splitting  to  create  a theory  of  complete  unification  for 
complex  data  types.  Such  a theory  would  provide  an  approach  to 
unification  in  more  complex  domains  such  as  those  employing 
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general  pattern  matching  actors  as  in  PLANWER  (21].  .In  addition, 
it  will  aid  in  the  proof  of  completeness  for  semi-unification  and 
variable  splitting. 

Consider  the  case  of  pattern  matching  actors  in  PLANNER. 

These  are  to  be  distinguished  from  the  more  general  actors 

consider  in  later  work  by  Hewitt  (23).  A (pattern  matching) 

actor  is  a completely  general  function  which  matches  expressions. 

Constants  (which  match  themselves)  and  variables  (which  match  | 

their  value  or  are  assigned  a value)  are  exeunples  of  actors.  | 

Vectors,  bags,  and  classes  are  also  actors  which  are  restricted  I 

to  matching  vectors,  bags,  and  classes  respectively  and  have  I 

* 

differing  conventions  on  treatment  of  expressions  in  the  matched  | 

I 

expression,  i.e.,  whether  they  are  order  independent  and  whether  j 

1 

they  may  be  matched  to  more  than  expression.  | 

An  example  of  a more  complex  actor  is  the  following  from  [21]:  I 


L 


<define  palindrome 
<actor  [] 

; "palindrome  is  a actor  of  no  arguments" 

<either 

<empty> 

;"a  palindrome  is  either  empty  or" 

<declaration  [x] 

; "declare  a new  local  x" 

<list  __x  (palindrome)  .x> 

;"let  X be  the  first  element  of  the 
linear  structure. 

Also  X must  be  the  last  element 
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with  a palindrome 
in  between‘'>>>> 

The  palindrome  actor  matches  palindromes,  sequences  of 
expressions  which  read  the  same  way  backwards  as  forwards.  It  is 
composed  of  three  additional  actors;  "either"  which  matches  any 
expression  which  matches  one  of  its  arguments,  "empty"  which 
matches  the  empty  expression,  and  ’Mist"  which  matches  any 
expression  composed  of  its  arguments  in  sequence.  .It  also 
recursively  calls  itself  in  seeking  to  match  an  expression 
composed  of  a subexpression  followed  by  a palindrome  followed  by 
the  same  subexpression. 

•Pattern  matching  is  concerned  only  with  the  syntax  of  the 
matched  expression.  Pattern  matching  matches  expressions  without 
regard  to  the  semantics  of  the  matched  expression.  For  example, 
the  palindrome  actor  will  examine  expressions  to  determine  if 
they  read  the  same  backwards  as  forwards,  but  will  not  attempt  to 
find  instances  of  the  matched  expression  if  it  includes  actors 
such  as  variables  which  are  palindromes.  The  consideration  of 
the  semantics  of  the  matched  expression,  and  the  attempt  to  find 
instances  of  the  matched  expression  for  which  the  pattern  match 
would  be  successful,  is  the  purpose  of  unification  as  opposed  to 
pattern  matching. 

As  a practical  matter,  it  is  not  reasonable  to  define 
unification  algorithms  for  large  classes  of  actors.  The 
definition  of  such  procedures  for  constants,  simple  variables, 
fragment  variables,  vectors,  bags,  and  classes  presented  here  is 
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quite  complex  enough  without  tne  addition  of  more  actors. 

Addition  of  another  actor  to  the  set  of  actors  usod  to  construct 
expressions  for  wnich  unification  algorithms  have  been  devised 
requires  explicit  consideration  of  how  the  new  actor  is  to  be 
unified  against  each  of  the  previously  defined  actors  and  against 
itself. 

In  addition  to  the  practical  impossibility  of  defining 
unification  algorithms  for  large  classes  of  actors,  there  are 
theoretical  difficulties,  if  the  actors  are  be  sufficiently 
general,  it  is  undecidable  whether  two  expressions  composed  from 
the  set  of  actors  have  any  unifier  at  all.  This  can  be  seen  as  a 
consequence  of  the  undecidability  of  the  emptiness  of  the 
intersection  of  two  context  free  languages.  If  the  set  of  actors 
from  which  expressions  can  be  constructed  is  sufficiently 
powerful  to  generate  the  set  of  context  free  languages,  it  is  not 
possible  to  determine  in  general  whether  any  pair  of  such 
expressions  has  a unifier. 

The  solution  to  the  problem  of  unification  with  general  actors 
is,  rather  than  defining  special  unification  algorithms  for 
handling  each  of  the  actors  (which  in  general  will  require 
consideration  of  each  pair  of  actors,  one  appearing  in  each  of 
the  expressions  being  matched)  , to  establish  a convention 
concerning  the  -form  of  expressions  unifiable  by  a 
semi-unification  algorithm  with  a completely  general  expression 
and  generable  by  an  elaboration  algorithm  from  a completely 
general  expression. 
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For  example,  the  palindrome  actor  as  written  above  is  complete 
when  matched  against  expressions  composed  entirely  of  constants. 
Thus,  this  actor  could  serve  as  a semi-unification  procedure  for 
palindromes,  the  matched  expression  being  restricted  to  being 
composed  of  constants.  It  is  also  clear  tnat  a procedure  could 
oe  written  that  successively  produces  all  expressions  composed  of 
constants  which  are  palindromes.  Such  a procedure  would  be 
called  an  elaboration  procedure. 

In  generating  a complete  unification  procedure  for  a set  of 
actors,  a semi-unification  procedure  and  elaboration  procedure 
would  be  defined  for  each  actor,  with  the  form  of  the  matched 
expression  in  semi-unification  and  the  form  of  the  expression 
generated  by  elaboration  both  dictated  by  the  commonly  understood 
convention.  In  the  extreme  case,  as  suggested  by  the  palindrome 
example  above,  the  convention  could  be  that  the  expression  be 
composed  solely  of  constants. 

The  procedure  for  unifying  two  expressions  then  becomea;  use 
the  elaboration  procedures  on  one  of  the  expression  to  form  an 
instance  of  the  expression  in  the  expression  suolanguage  dictated 
by  the  convention,  and  match  this  newly  formed  expression  against 
the  other  expression  using  the  semi-unification  procedures.  In 
the  example  where  the  convention  dictates  that  expressions  be 
composed  entirely  of  constants,  the  procedure  becomea:  form  a 
ground  instance  of  one  of  the  expressions  and  match  it  to  the 
other  expression. 

This  procedure  for  unifying  general  expressions  is  complete 
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provided:  every  ground  unifier  of  a general  expression  and  an 
expression  formed  according  to  the  convention  is  an  instance  of  a 
unifier  returned  by  the  semi-unification  procedures,  and  every 
instance  of  a general  expression  is  an  instance  of  an  expression 
formed  according  to  the  convention  by  the  elaboration  procedures. 

Subject  to  the  above  two  conditions,  the  statement  of 
completeness  means  every  ground  unifier  of  two  general 
expressions  is  an  instance  of  a unifier  returned  by  the 
semi-unification  procedures  applied  to  one  of  the  expressions  and 
an  elaboration  of  the  other  expression. 

The  use  of  the  set  of  expressions  formed  entirely  of  constants 
as  the  expression  sublanguage  understood  by  both  the 
semi-unification  and  elaboration  procedures  is  often  more 
restrictive  than  necessary.  A larger  sublanguage  which  permits  a 
substantial  reduction  in  the  number  of  expressions  produced  by 
elaboration  is  the  language  composed  of  constants  and  simple 
variables . 

For  example,  consider  again  the  palindrome  actor.  Assume  it 
is  intended  to  match  a vector  of  elements.  Then  elaborations  of 
the  palindrome  actor  would  be  (using  our  notation  for  simple 
variables,  etc.)  (VECTOR),  (VECTOR  *Xl  *X1),  (VECTOR  -Xl  *X2  ♦X2 
»Xl),  ....  The  semi-unification  procedure  for  the  palindrome 
actor  would  be  required  to  match  expressions  consisting  of 

constants  and  simple  variables,  assigning  values  to  the  simple  i 

variables  as  required  to  generate  a unifier.  | 

Constants  and  simple  variables  in  this  case  are  called 


primitive  actors,  understood  both  by  semi-unification  and 
elaboration  procedures.  Fragment  variables  are  non-primitive 
actors.  Elaborations  of  a fragment  variable  would  be  the  empty 
sequence,  a single  simple  variable,  two  simple  variables,  etc. 
Elaborations  of  the  expression  (VECTOR  ♦♦X  ♦♦Y)  would  be 
(VECTOR),  (VECTOR  -Xl),  (VECTOR  -Yl),  (VECTOR  *X1  -Yl) , (VECTOR 

•XI  ♦X2) , (VECTOR  •Yl  •Y2) , (VECTOR  -Xl  •X2  •Yl),  

In  this  case,  unification  of  (VECTOR  A ♦♦X)  and  (VECTOR  **Y  3] 
could  result  in  (elaborating  the  second  expression)  unifying 
(VECTOR  A --X)  and  (VECTOR  B)  with  no  unifiers,  (VECTOR  A ♦♦X) 
and  (VECTOR  ♦Yl  3)  with  unifier  (instantiating  the  parent 
fragment  variable  Y)  {X-(VECTOR  B),Y*(VECTOR  A)},  (VECTOR  A *-X) 
and  (VECTOR  ♦Yl  ♦YZ  B)  with  unifier  {X^(VECTOR  '-YZ  a),Y*(V£CTOR  A 
*Y2)},  (VECTOR  A ♦•^X)  and  (VECTOR  *-Yl  -YZ  B)  with  unifier 

{X- (VECTOR  -YZ  -Y3  B)  ,Y-- (VECTOR  A *Y2  ♦Y3)),  etc. 


3.12.  Completeness  of  semi-unification  and  variable  splitting 

We  will  prove  here  that  semi-unification  is  complete  for  two 
important  cases;  the  case  where  one  of  the  expressions  is 
variable  free  and  the  case  where  both  expressions  are  fragment 
variable  free.  The  completeness  of  semi-unification  for  these 
two  cases  provides  the  basis  for  two  uses  of  semi-unification  and 
variable  splitting  which  yield  completeness  in  the  general  case. 

The  first  use  of  semi-unification  and  variable  splitting 
requires  that  variables  in  one  of  the  expressions  be  instantiated 
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by  expressions  of  the  form  (veCTOK  ■►♦X)  , (DAG  *-*X)  , and  (CLASS 
♦*X)  and  variable  splitting  be  applied  to  the  result  of  such 
instantiations.  The  other  expression  is  left  unaltered. 

The  second  use  of  semi-unification  and  variable  splitting 
requires  that  variable  splitting  be  applied  to  both  expressions, 
but  eliminates  the  need  for  instantiating  either  of  them  as  in 
the  first  case. 

For  example,  consider  the  unification  of  (CLASS  ♦X)  and  (CLASS 
(VECTOR  A "•♦Y)  (VECTOR  **Z  B)  ) . Applying  the  second  method  and 
performing  a single  variable  splitting  operation  on  one  of  the 
vectors  in  the  second  argument,  we  obtain  (CLASS  *X)  and  (CLASS 
(VECTOR  A -*Y1  ♦♦Y2)  (VECTOR  **Z  B) ) with  most  general  unifier 
{X*  (VECTOR  A ♦♦Yl  B)  , Y*  (VECTOR  ♦*Y1  B)  , Z*  (VECTOR  A ♦♦Yl)}.  .In 
this  example,  we  were  successful  in  applying  the  method  and  using 
variable  splitting  on  only  one  of  the  expressions;  in  general, 
this  will  not  work  and  variable  splitting  must  be  applied  to  both 
expressions.  An  example  for  which  variable  splitting  must  be 
applied  to  both  methods  is  (VECTOR  (CLASS  -X)  (CLASS  (VECTOR  A 
♦♦V)  (VECTOR  B) ) ) and  (VECTOR  (CLASS  (VECTOR  A ♦-Y)  (VECTOR 

♦ ♦Z  B)  ) (CLASS  ♦U)),  i.e.,  an  extension  of  the  earlier  problem 
where  now  fragment  variables  which  must  be  split  appear  in  both 
expressions . 

Application  of  the  first  method  and  applying  variable 
splitting  to  the  second  expression  works  just  as  for  the  second 
method.  However,  if  in  using  the  first  method,  it  is  desired 
that  variable  splitting  be  performed  on  the  first  expression,  it 
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must  first  be  instantiated.  The  ♦X  must  be  replaced  by  (VECTOR 
♦ *X1)  and  XI  must  be  split  twice  yielding  (CLAES  (VECTOR  **'A2 
«.»X3  ♦■►X4)).  Semi-unification  will  now  find  tne  most  general 
unifier  of  the  two  expressions.  In  general,  in  applying  the 
first  method,  variables  must  be  instantiated  by  expressions  with 
new  fragment  variable  arguments  with  tne  same  type  as  the  type  of 
expressions  to  which  they  may  be  matched.  In  the  example,  ♦X  had 
to  be  instantiated  by  a vector  since  it  was  going  to  be  matched 
to  a vector. 

The  choice  of  method  is  dependent  on  the  expected  form  of 
expressions  to  be  unified  as  discussed  below. 

Theorem  3.2.  The  procedure  semi-unify  is  complete  if  at  least 
one  of  the  input  expressions  contains  no  variables  (the  procedure 
semi-unify  is  complete  for  pattern  matching) . 

^£22l*  Induction  will  again  be  performed  on  the  tuple  (nj^,n2) 
where  n^  is  the  number  of  distinct  variables  occurring  in  the 
expressions  and  n2  is  the  total  number  of  elements  occurring  in 
the  expressions  excluding  those  already  matched  as  elements  of 
cl  asses . 

(nj^,n2)*  (0, 0)  . If  n2*0,  then  neither  expression  has  any 
(unmatched)  elements.  Tnus,  each  must  be  an  empty  vector,  an 
empty  bag,  or  a class  with  all  elements  matched.  If  they  are  not 
both  empty  vectors,  not  both  empty  bags,  or  not  both  classes  with 
all  elements  matched,  then  the  empty  set  of  substitutions  is 
returned  (semi-unify  steps  5b,  6b,  7b).  Such  pairs  of 
expressions  have  no  unifier  and  thus  semi-unify  is  complete  for 
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tnis  case.  If  they  are  both  empty  vectors,  both  empty  bags,  or 
both  classes  with  all  elements  matched  (i.e.,  identical  classes), 
the  null  substitution  is  returned  as  the  only  unifier  (semi-unify 
step  1).  Semi-unify  is  complete  for  this  case  since  every 
unifier  is  an  instance  of  the  returned  null  substitution. 

Assume  semi-unify  is  complete  for  all  ( n^ , 02)  < (m^^  ,m2 ) • Prove 
it  is  complete  for  (02,02)=  (mj^ , m2)  . Assume  (02,02)=  (m^  , m2)  for 
two  expressions.  Each  of  the  input  expressions  must  be  a 
constant,  simple  variable,  vector,  bag,  or  class. 

1.  If  both  expressions  are  constants,  either  the  empty  set  of 
substitutions  is  returned  (semi-unify  step  4)  or  the  null 
suostitution  is  returned  as  the  only  unifier  (semi-unify  step  1) . 
Semi-unify  is  complete  in  the  first  case  since  non- identical 
constants  have  no  unifier.  Semi-unify  is  complete  in  the  second 
case  since  every  unifier  is  an  instance  of  the  returned  null 
substitution . 

2.  If  one  expression  is  a constant  and  the  other  is  a simple 
variable,  the  substitution  of  the  constant  for  tne  variable  is 
returned  as  the  only  unifier  (semi-unify  steps  2a,  3a).  The 
substitution  of  the  constant  for  the  variable  is  the  only  unifier 
of  the  two  expressions,  so  semi-unify  is  complete  for  this  case. 

3.  If  both  expressions  are  simple  variables,  either  the  null 
substitution  is  returned  as  the  only  unifier  (semi-unify  step  1) 
or  the  substitution  of  the  second  variable  for  the  first  is 
returned  as  the  only  unifier  (semi-unify  step  2a) . In  the  first 
case,  semi-unify  is  complete  since  every  unifier  is  an  instance 
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of  the  returned  null  substitution.  in  the  second  case, 
semi-unify  is  complete  since  the  returned  substitution  of  one 
variable  for  the  other  is  maximally  general  with  the  two 
instantiated  variables  equal  to  one  of  the  original  variables. 

4.  If  one  expression  is  a constant  and  the  other  is  a vector, 
bag,  or  class,  the  empty  set  of  substitutions  is  returned 
(semi-unify  step  4).  Semi-unify  is  complete  in  this  case  since 
such  pairs  of  expressions  have  no  unifier. 

5.  If  one  expression  is  a simple  variable  and  tne  other  is  a 
vector,  bag,  or  class,  either  the  substitution  of  the  vector, 
bag,  or  class  for  the  variable  is  returned  as  the  only  unifier 
(semi-unify  steps  2bl , 2cl , 2dl,  3bl,  3cl,  3dl ) or  the  vector, 
bag,  or  class  is  unified  with  a vector,  bag,  or  class  with  new 
fragment  variable  elements  substituted  for  the  simple  variable 
argument  (semi-unify  steps  2b2,  2c2,  2d2,  3b2,  3c2,  3d2).  In  the 
first  case,  semi-unify  is  complete  since  the  returned 
substitution  for  the  variable  by  the  other  expression  is 
maximally  general  with  the  two  instantiated  expressions  equal  to 
one  of  the  original  expressions.  The  second  case  cannot  occur  by 
the  assumption  that  only  one  of  the  expressions  nas  any 
variables . 

6.  If  both  expressions  are  vectors,  bags,  or  classes,  but  not 
both  of  the  same  type,  the  empty  set  of  substitutions  is  returned 
(semi-unify  steps  5b,  6b,  7b).  In  this  case,  semi-unify  is 
complete  since  such  pairs  of  expressions  have  no  unifier. 

7.  If  both  expressions  are  vectors,  either  (a)  the  null 
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substitution  is  returned  as  the  only  unifier  (semi-uniiy  step  1), 

(b)  the  empty  set  of  substitutions  is  returned  (semi-unifyvcctor 
steps  lb,  2b)  , (c)  two  identical  elements  are  removed  and  the 
remaining  subvectors  are  semi-unified  ( semi-unifyvector  step  3), 

(d)  suostitutions  are  made  for  a fragment  variaole  and  the 
instantiated  remaining  subvectors  are  semi-unified 
{semi-unifyvector  steps  la,  2a,  4a,  4b,  5a),  or  (e)  two 
expressions  are  semi-unified  and  the  instantiated  remaining 
subvectors  are  semi-unified  (semi-unifyvector  step  5b). 

7a.  In  subcase  a,  semi-unify  is  complete  since  every  unifier 
is  an  instance  of  the  returned  null  substitution. 

7b.  In  subcase  b,  semi-unifyvector  is  complete  since  such 
pairs  of  expressions  have  no  unifier. 

7c.  In  subcase  c,  semi-unifyvector  is  called  recursively  with 
expressions  with  measure  (Xj^,X2)  with  X2^<n2  and  X2<n2-2.  Thus, 
since  { X2,X2)< (n2fn2) * by  the  inductive  assumption  this  recursive 
call  is  complete.  Every  unifier  of  two  vectors  with  identical 
first  elements  is  a unifier  of  the  two  vectors  with  identical 
first  elements  removed.  Therefore,  semi-unifyvector  is  complete 
for  this  subcase. 

7d.  In  subcase  d,  semi-unifyvector  is  recursively  called  with 
expressions  with  measure  (n2-l,x)  for  some  x.  Thus,  since 
(n^-l ,x)< (n2»n2)  for  all  x,  by  the  inductive  assumption  this 
recursive  call  is  complete.  The  assignments  to  the  fragment 
variable  are  complete  since  (1)  every  possible  assignment  of  an 
initial  subvector  for  the  fragment  variable  is  performed  since 
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there  are,  by  assumption,  no  (fragment)  vari abler  in  the  opposing 
vector  and  (2)  the  substitution  of  subvectors  for  fragment 
variables  are  maximally  general.  Therefore,  the  compositions  of 
substitutions  of  initial  subvectors  for  the  fragment  variable  and 
returned  unifiers  for  the  remaining  vectors  are  a complete  set  of 
unifiers  for  the  vectors  and  semi-unif yvector  is  complete  for 
this  subcase. 

7e.  In  subcase  e,  semi-unify  is  called  recursively  with 
expressions  with  measure  (xj^,X2)  with  X3^<n2  and  X2<n2.  If  called 
at  all  (if  semi-unify  I a;b]  succeeds),  semi-unif  yvector  is  called 
recursively  with  expressions  with  measure  (x^,X2)  with  for 

some  X2  (since  if  the  first  elements  are  unifiaole  but  not 
identical  at  least  one  variable  must  have  been  substituted  for) . 
Thus,  by  the  inductive  assumption,  each  recursive  call  is 
complete.  Therefore,  the  compositions  of  unifiers  of  the  first 
elements  and  unifiers  of  the  remaining  vectors  are  a complete  set 
of  unifiers  for  the  vectors  and  semi-unifyvector  is  complete  for 
this  subcase. 

8.  If  both  expressions  are  bags,  either  (a)  the  null 
substitution  is  returned  as  the  only  unifier  (semi-unify  step  1) , 
(b)  the  empty  set  of  substitutions  is  returned  ( semi-unif ybag 
steps  lb,  2b),  (c)  two  identical  elements  are  removed  and  the 
remaining  subbags  are  semi-unified  ( semi-uni f ybag  step  3),  (d) 
substitutions  are  made  for  a fragment  variable  and  the 
instantiated  remaining  subbags  are  semi-unified  { semi-unif ybag 
steps  la,  2a,  4a,  4b,  5a),  or  (e)  two  expressions  are 
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semi-unified  and  the  instantiated  remaining  subbagc  are 
serai-unified  (semi-unif ybag  step  bb) . 

da.  In  subcase  a,  serai-unify  is  complete  since  every  unifier 
is  an  instance  of  the  returned  null  substitution. 

8b.  In  subcase  b,  semi-uni f ybag  is  complete  since  such  pairs 
of  expressions  have  no  unifier. 

8c.  In  subcase  c,  semi-unif ybag  is  called  recursively  with 
expressions  with  measure  (X2,X2)  with  and  X2<n2-2.  Thus, 

since  inductive  assumption  this  recursive 

call  is  complete.  Every  unifier  of  two  bags  with  identical 
elements  is  a unifier  of  the  two  bags  with  identical  elements 
removed.  Therefore,  semi-unif ybag  is  complete  for  this  subcase. 

8d.  In  subcase  d,  semi-unif ybag  is  recursively  called  with 
expressions  with  measure  (n^-lfX)  for  some  x.  Thus,  since 
(n^-l  ,x)  < (n2,n2)  for  all  x,  by  the  inductive  assumption  this 
recursive  call  is  complete.  The  assignments  to  the  fragment 
variable  are  complete  since  (1)  every  possible  assignment  of  a 
subbag  for  the  fragment  variable  is  performed  since  there  are,  by 
assumption,  no  (fragment)  variables  in  the  opposing  bag  and  (2) 
the  substitution  of  subbags  for  fragment  variables  are  maximally 
general.  Therefore,  the  compositions  of  substitutions  of  subbags 
for  the  fragment  variable  and  returned  unifiers  for  the  remaining 
bags  are  a complete  set  of  unifiers  for  the  bags  and 
semi-uni f ybag  is  complete  for  this  subcase. 

8e.  In  subcase  e,  semi-unify  is  called  recursively  with 
expressions  with  measure  (X2,X2)  with  x^^n^  and  X2<n2.  If  called 
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at  all  (if  semi-unif y ( a ;b]  succeeds),  semi-unifybag  is  called 
recursively  with  expressions  with  measure  (X2,X2)  with  X2<n2  for 
some  X2  (since  if  the  elements  are  unifiable  but  not  identical  at 
least  one  variable  must  have  been  substituted  for).  Thus,  by  the 
inductive  assumption,  each  recursive  call  is  complete.  In 
addition,  the  first  element  of  the  first  bag  is  allowed  to  be 
matched  against  every  element  of  the  second  bag.  This  is  a 
complete  set  of  matches  for  the  first  element  of  the  first  bag. 
Therefore,  the  compositions  of  unifiers  of  the  matched  elements 
and  unifiers  of  the  remaining  bags  are  a complete  set  of  unifiers 
for  the  bags  and  semi-unifybag  is  complete  for  this  subcase. 

9.  If  both  expressions  are  classes,  either  (a)  the  null 
substitution  is  returned  as  the  only  unifier  (semi-unify  step  1) , 
(b)  one  of  two  identical  elements  in  a single  class  is  removed 
and  the  remaining  subclasses  semi-unified  ( semi-unif yclass  steps 
1,  2),  (c)  two  identical  elements  are  matched  against  each  other 
and  the  remaining  subclasses  are  semi-unified  (semi-unifyclass 
step  3),  (d)  substitutions  are  made  for  a fragment  variable  and 
the  instantiated  remaining  subclasses  are  semi-unified 
(semi-unifyclass  steps  5a,  5b,  6a),  or  ( e)  two  expressions  are 
semi-unified  and  the  instantiated  remaining  subclasses  are 
semi-unified  (semi-unifyclass  step  6b) . Tne  last  two 
possibilities  may  occur  after  a one  time  interchange  of  the  two 
classes  (semi-unifyclass  step  4). 

9a.  In  subcase  a,  semi-unify  is  complete  since  every  unifier 
is  an  instance  of  the  returned  null  substitution. 
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9b.  In  subcase  b,  semi-unif yclass  is  called  recursively  with 
expressions  with  measure  (Xj^,x2)  with  Xi<n^  and  X2<n2-1.  Thus, 
since  ( Xj. » *2  J ^ inductive  assumption  this  recursive 

call  is  complete.  Every  unifier  of  two  classes  with  repeated 
elements  in  a class  is  a unifier  of  the  two  classes  witn  all  such 
elements  but  one  in  each  class  removed.  Therefore, 
semi-unifyclass  is  complete  for  this  subcase. 

9c.  In  subcase  c,  semi-unifyclass  is  called  recursively  with 
expressions  with  measure  {X2^,X2)  witn  xj^<nQ^  and  X2<n2-1.  Thus, 
since  ( x^ ,X2 )< ( n^, n2) » by  tne  inductive  assumption  this  recursive 
call  is  complete.  Every  unifier  of  two  classes  with  identical 
elements  is  a unifier  of  the  two  classes  with  identical  elements 
removed.  Therefore,  semi-unifyclass  is  complete  for  this 
subcase . 

9d.  In  subcase  d,  semi-unifyclass  is  recursively  called  with 

i 

expressions  with  measure  (nj^-l.x)  for  some  x.  Thus,  since  | 

(n^-l ,x)<  (n2,n2)  for  all  x,  by  the  inductive  assumption  this 

recursive  call  is  complete.  The  assignments  to  the  ‘fragment 

variable  are  complete  since  (1)  every  possible  assignment  of  a 

subclass  for  the  fragment  variable  is  performed  since  there  are, 

by  assumption,  no  (fragment)  variables  in  the  opposing  class  and 

(2)  the  substitution  of  subclasses  for  fragment  variables  are 

maximally  general.  Therefore,  the  compositions  of  substitutions 

of  subclasses  for  the  fragment  variable  and  returned  unifiers  for 

the  remaining  classes  are  a complete  set  of  unifiers  for  the 

classes  and  semi-unifyclass  is  complete  for  this  subcase. 
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9e.  In  subcase  e,  semi-unify  is  called  recursively  with 


expressions  with  measure  (Xj^,X2)  with  Xj^<n-^  and  X2<n2.  If  called 
at  all  (if  semi-uni£y(a;b]  succf-cds)  , scmi-unifyclass  is  called 
recursively  with  expressions  with  measure  (Xj^,X2)  with  xj^<ni  for 
some  X2  (since  if  the  first  elements  are  unifiable  but  not 
identical  at  least  one  variable  must  have  been  substituted  for) . 
Thus,  by  the  inductive  assumption,  each  recursive  call  is 
complete.  In  addition,  the  first  element  of  the  first  class  is 
allowed  to  be  matched  against  every  element  of  the  second  class, 
i.e.,  every  element  of  the  original  second  class  since  even 
previously  matched  elements  are  retained.  This  is  a complete  set 
of  matches  for  the  first  element  of  the  first  class.  Therefore, 
the  compositions  of  unifiers  of  the  matched  elements  and  unifiers 
of  the  remaining  classes  are  a complete  set  of  unifiers  for  the 
classes  and  semi-unif yclass  is  complete  for  this  subcase. 

Thus,  subject  to  the  inductive  assumption  that  semi-unify  is 
complete  in  the  case  where  at  least  one  of  the  input  expressions 
contains  no  variables  for  all  expressions  with  measure 
(n2 » 02)  < (nij^  ,m2  ) » semi-unify  is  complete  for  all  expressions  at 
least  one  of  which  contains  no  variables  with  measure 
(n2,n2)*  (ini,m2)  • 0 

The  proof  presented  here  is  actually  a little  stronger  (in  a 
difficult  way  to  state  or  use)  than  that  needed.  The  assumption 
that  one  of  the  expressions  has  no  variables  is  only  used  in 
steps  5,  7d,  8d , and  9d  of  the  proof;  it  is  not  used  in  some 
other  steps  in  which  variables  could  be  in  both  expressions. 
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The  assumption  is  required  in  steps  16,  8d , and  9d  because 
semi-unification  does  not  permit  an  initial  subvector,  a subbag, 
or  a subclass  including  only  part  of  a fragment  variable  to  be 
assigned  to  a fragment  variable;  only  an  integral  number  of 
elements  can  be  assigned,  l.e.,  a -fragment  variable  in  the 
expression  opposing  the  fragment  variable  being  substituted  for 
must  be  either  totally  included  in  or  excluded  from  the 
substitution . 

This  theorem  is  adequate  for  proving  the  completeness  of  a 
primitive  form  of  semi-unification  and  elaboration.  The 
semi-unification  algorithm  is  complete  if  at  least  one  expression 
has  no  variables;  the  elaboration  process  produces  all  ground 
instances  of  one  of  the  expressions.  By  the  completeness  of 
semi-unification  in  the  restricted  case,  and  the  exhaustiveness 
of  elaboration,  semi-unification  plus  elaboration  is  complete. 

However,  this  form  of  elaboration  is  primitive  and 
inefficient.  Me  choose  to  use  a more  powerful  elaboration 
process  (variable  splitting).  The  result  of  variable  splitting 
is  so  general,  retaining  all  the  kinds  of  elements  as  the 
original  expression,  and  retaining  -fragment  variables  in 
particular,  that  semi-unification  applied  to  two  expressions,  one 
modified  by  variable  splitting,  is  incomplete  in  general. 
Nevertheless,  semi-unification  plus  variable  splitting  is 
complete  since  unifiers  not  returned  by  semi-unification  after  a 
sequence  of  variable  splitting  operations  will  be  returned  by 
semi-unification  after  a different  sequence  of  variable  splitting 
operations. 
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lil*  The  unifiers  returned  by  semi-unification  of  two 
expressions  are  instances  of  unifiers  returncu  by 
semi-unification  of  generalizations  of  the  two  expressions  where 
individual  constants  or  simple  variables  are  replaced  by  fragment 
variables . 

Proof.  The  point  of  this  theorem  is  that  anything  you  can  do 
with  constants  or  simple  variables  in  terms  of  unification  you 
can  do  with  a fragment  variable.  Note  that  this  theorem  is  not 
true  where  the  generalization  associates  a fragment  variable  with 
a sequence  of  simple  variables  or  constants  of  length  greater 
than  one;  if  this  were  so,  semi-unification  alone  would  be 
complete  and  variable  splitting  would  be  unnecessary. 

A simple  variable  or  constant  can  be  matched  in  three  ways.: 

(1)  it  can  be  matched  against  another  simple  variable  or 
constant,  (2)  it  can  be  included  in  the  value  of  a fragment 
variable,  or  (3)  it  can  be  an  element  of  an  expression  matched  to 
a simple  variable  or  included  in  the  value  of  a fragment 
variable . 

.In  the  first  case,  a generalization  where  the  simple  variable 
or  constant  is  replaced  by  a fragment  variable  could  be 
successfully  matched  by  matching  the  fragment  variable  to  the 
other  simple  variable  or  constant. 

■In  the  second  and  third  cases,  a generalization  where  the 
simple  variable  or  constant  is  replaced  by  a fragment  variable 
could  be  successfully  matched  in  precisely  the  same  way  as 
before . 

234 


'1 


Therefore,  no  unifiers  are  lost  by  the  use  of  -fragment 
variables  rather  than  simple  variables  or  constants.  0 

The  use  of  the  procedure  semi-unify  with 
variable  splitting  applied  to  instances  of  one  of  the  expressions 
is  complete  where  the  instances  are  formed  by  replacing  simple  or 
fragment  variables  by  vectors,  bags,  or  classes  with  new  fragment 
variable  arguments. 

££22^*  True  by  the  completeness  of  semi-unification  for 
pattern  matching,  i..e.,  the  completeness  of  serai-unification 
where  at  least  one  of  the  expressions  has  no  variables,  and  the 
previous  theorem  that  generalizing  simple  variables  or  constants 
to  fragment  variables  results  in  the  loss  of  no  unifiers.  The 
procedure  is  just  doing  elaboration  on  one  of  the  expressions 
except  instead  of  producing  ground  expressions,  its  output  is 
expressions  with  fragment  variables  which  could  represent 
constants.  0 

The  procedure  semi-unify  is  complete  if  neither 
of  the  input  expressions  has  any  fragment  variables. 

f£22£*  The  proof  is  essentially  the  same  as  that  for  the 
earlier  theorem  stating  the  completeness  of  serai-unification  for 
pattern  matching,  i .e. , the  completeness  of  semi-unification  in 
the  case  where  at  least  one  expression  has  no  variables. 

The  only  differences  are  (1)  all  references  to  fragment 
variables  (cases  7d,  8d , 9d)  may  be  omitted  and  (2)  the  second 
part  of  case  5 (the  expression  being  matched  to  a simple  variable 
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since  there 


containing  the  variable)  may  now  occur.  In  case  5, 
are  no  fragment  variables  in  the  two  expressions,  the  occurrence 
of  a variable  in  the  opposing  expression  in  case  b must  be  a 
simple  variable  occurrence.  But  such  an  expression  cannot  oe 
unified  with  the  simple  variable,  and  thus  semi-unify  is  complete 
for  this  case.  The  introduction  of  fragment  variables  in  this 
case  (by  semi-unify  steps  2b2,  2c2,  2d2,  3b2,  3c2,  3d2)  presents 
no  difficulty  since  (by  soundness)  the  unification  will  fail,  and 
the  introduced  fragment  variables  will  not  be  available  for 
future  recursive  calls  to  violate  the  assumption  of  the 
expressions  net  having  any  fragment  variables.  □ 

3^6.  The  use  of  the  procedure  semi-unify  with 
variable  splitting  applied  to  both  expressions  is  complete. 

Proof.  By  the  previous  theorem,  semi-unification  is  complete 
for  expressions  having  no  fragment  variables.  Thus,  replacing 
fragment  variables  in  both  input  expressions  by  sequences  of 
distinct  simple  variables  plus  semi-unification  is  complete  due 
to  the  completeness  of  semi-unification  for  the  altered 
expressions  and  the  exhaustiveness  of  the  replacement  of  fragment 
variables  by  sequences  of  simple  variables  (we  are  just 
elaborating  both  input  expressions  to  expressions  with  no 
fragment  variables  by  replacing  them  by  sequences  of  simple 
variables)  . 

By  the  theorem  that  the  unifiers  returned  by  semi-unification 
of  two  expressions  are  instances  of  unifiers  returned  by 
semi-unification  of  generalizations  of  the  two  expressions  where 


236 


individual  constants  or  simple  variables  are  replaced  by  fragment 
variables,  replacing  fragment  variables  in  botn  input  expressions 
by  sequences  of  distinct  fragment  (rather  than  simple)  variables 
plus  semi-unification  is  complete.  i3ut  such  seouences  may  be 
generated  by  applying  variable  splitting,  so  semi-unification 
plus  the  application  of  variable  splitting  to  both  expressions  is 
complete,  u 

We  now  have  two  alternative  methods  of  extending 
semi-unification  to  completeness  by  use  of  variable  splitting. 
Some  choice  must  be  made  of  whether  to  implement  the  first 
method,  the  second  method,  or  both  methods. 

If,  for  example,  it  is  known  that  all  the  expressions  that 
will  be  matched  against  each  other  are  one  level  deep  (contain  as 
elements  only  fragment  variables  and  atomic  expressions)  , then  it 
is  sensible  to  implement  only  the  -first  method  since  no 
instantiation  would  be  required  and  the  use  of  variable  splitting 
on  the  second  expression  would  be  redundant. 

On  the  other  hand,  if  it  is  known  that  the  expressions  that 
will  be  matched  against  each  other  will  usually  be  quite  complex 
with  many  levels,  the  required  instantiation  of  the  first  method 
may  be  too  inconvenient  and  costly  to  perform.  In  this  case,  the 
second  method  would  probably  be  preferred. 

A third  alternative  is  to  implement  botn  methods  and 
dynamically  choose  which  to  employ  when  presented  with  two 
expressions  to  be  matched.  For  the  completely  general  case  where 
there  is  no  a priori  reason  to  expect  expressions  of  one  kind  as 
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opposed  to  that  of  another,  this  is  probably  the  best  alternative 
permitting  the  more  efficient  use  of  variable  splitting  in  the 
separate  cases. 


3.13.  Vector  unification 

.It  has  already  been  stated  several  times  that  (unless  some 
extended  notation  for  unifiers  is  adopted)  the  number  of  unifiers 
in  the  general  vector  unification  case  may  be  infinite. 

For  expository  purposes,  we  present  here  a vector  unification 
aloorithm  which,  although  possibly  non- terminating  even  in  cases 
where  there  are  no  unifiers,  is  sound  and  complete. 

This  vector  unification  algorithm  illuminates  the  role  of 
variable  splitting  in  extending  serai-unification  to  completeness, 
and  is  a model  for  the  class  unification  algorithm  to  be 
presented  later. 

The  unifyvector  procedure,  when  presented  with  two  non-empty 
vectors,  always  removes  the  first  element  of  the  first  vector  at 
each  step. 

When  presented  with  two  vectors  whose  first  elements  are  not 
fragment  variables,  it  directly  unifies  the  first  elements  and 
then  unifies  the  instantiated  remainders  of  the  vectors.  When 
one  or  both  of  the  first  elements  of  the  vectors  are  fragment 
variables,  the  substitution  of  the  empty  vector  for  the  fragment 
variable  and  a general  substitution  including  the  opposing  first 
element  for  the  fragment  variable  are  both  tried. 


Steps  are  numbered  in  close  correspondence  to  the  numbering  in 
the  vector  semi-unification  algorithm  for  comparison  purposes. 

1.  If  X is  of  the  form  (VECTOR)  and 

a.  y is  of  the  form  (VECTOR  --Y  8)  then  let  0= {Y - (VECTOR) ) 
and  return  { ea  1 o € semi-unify ( x;  (VECTOR  8)0]). 

b.  y is  not  of  the  form  (VECTOR  ♦*Y  8)  then  return  {). 

2.  If  y is  of  the  form  (VECTOR)  and 

a.  X is  of  the  form  (VECTOR  ♦•X  o)  then  let  0*  {X-^  (VECTOR)  ) 
and  return  {00  I o c semi-uni fy ( (VECTOR  a)0;  yl}. 

b.  X is  not  of  the  form  (VECTOR  ♦♦X  o)  then  return  {}. 

3.  If  X is  of  the  form  (VECTOR  a a)  and  y is  of  the  form 
(VECTOR  a 8)  for  arbitrary  element  a then  semi-unify( (VECTOR 
a) ; (VECTOR  0) ] . 

4.  If  X is  of  the  form  (VECTOR  ♦♦X  a)  then 

a.  if  y is  of  the  form  (VECTOR  ♦♦Y  8)  then  let  0={Y»  (VECTOR 
**X  ♦♦Y')}  and  return  (00  I a e unify ( (VECTOR  a)0;  (VECTOR 
♦•Y ' 8)0]}  and 

b.  if  y is  of  the  form  (VECTOR  b 8)  and  X is  not  contained  in 
b then  let  0»{X* (VECTOR  b ♦♦X*)}  and  return  {00  I 0 c 
unify ( (VECTOR  —X'  a)0;  (VECTOR  8)0]}  and 

c.  if  y is  of  the  form  (VECTOR  b b)  where  element  b is  not  a 
fragment  variable  let  0*{X*  (VECTOR) } and  return  I0O  I o c 
unify ( (VECTOR  a)0;  y0)}. 

5.  If  X is  of  the  form  (VECTOR  a o)  where  element  a is  not  a 
fragment  variable  then 
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a.  y is  of  the  form  (VECTOR  **Y  B)  and  Y is  not  contained  in 
a then  let  e={Y*(VECTOR  a ♦♦Y'))  and  return  {00  I a e 
unify  I (VECTOR  o)0;  (VECTOR  — Y'  8)0)}  and 

b.  if  y is  of  the  form  (VECTOR  b 8)  wnere  element  b is  not  a 
fragment  variable  then  return  {00  I 0 c unify(a;  b]  a a € 
unify  I (VECTOR  o)0;  (VECTOR  8)0)}  and 

c.  if  y is  of  the  form  (VECTOR  **Y  8)  then  let  0={Yv (VECTOR) } 
and  return  l0o  | a e unifylxB;  (VECTOR  8)0]). 

Notes : 

1.  If  one  of  the  vectors  is  the  empty  vector  and  the  other  is 
not,  the  latter  must  be  composed  solely  of  fragment  variables 
wnich  must  be  bound  to  the  empty  vector.  Otherwise,  there  is  no 
unifier  (steps  1,  2). 

2.  If  the  first  elements  of  two  vectors  are  identical,  the 
remainders  of  the  vectors  are  unified  (step  3). 

3.  If  the  first  element  of  one  or  both  of  the  vectors  is  a 
fragment  variable,  the  substitution  for  the  fragment  variable  of 
the  vector  consisting  of  the  opposing  first  element  and  a new 
fragment  variable  is  performed  (meaning  the  fragment  variable 
includes  the  opposing  first  element)  and  the  remainders  of  the 
vectors  are  unified  (steps  4a,  4b,  5a). 

4.  If  the  -first  element  of  one  (but  not  both)  of  the  vectors 
is  a fragment  variable,  the  substitution  of  the  empty  vector  for 
the  fragment  variable  is  performed  and  the  remainders  of  the 
vectors  are  unified  (steps  4c,  5c). 

5.  If  neither  of  the  first  elements  of  the  vectors  is  a 
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fragment  variable,  the  two  first  elements  are  unified  against 
each  other  and  the  remainders  of  the  vectors  are  unified  for  each 
unifier  of  the  first  elements  (step  5b). 

This  algorithm  can  readily  be  made  substantially  more 
efficient.  One  major  cause  of  inefficiency  is  that  every  time  a 
fragment  variable  is  matched  against  a non-fragment  variable,  a 
new  fragment  variable  is  formed  (steps  4b,  5a).  Actually,  it 
would  be  sufficient  to  match  the  fragment  variable  against  an 
entire  sequence  of  non-fragment  variables  similarly  to 
semi-unification.  Thus,  when  matching  (VECTOR  ♦♦X  a)  and  (VECTOR 
A B **Y) , the  substitutions  {X*(VECTOR) } , {X-(VECTOR  A)},  and 
{X*(V£CTOR  A B ♦♦Xl ) } would  be  tried  in  parallel  rather  than  the 
successive  { X -■  (VECTOR)  } , {X*(VECTORA  ♦*Xl)},  {X  * (VECTOR  A ) , 

XI -►(VECTOR)  ) , and  {X-(VECTOR  A B *-X2),  XI -(VECTOR  3 ♦*X2)}. 

The  algorithm  is  expressed  in  this  simpler,  less  efficient 
form  so  it  can  serve  as  a template  for  a later  description  of  the 
similar  class  unification  algorithm  for  which  similar  fixes  are 
difficult  to  describe  and  the  less  efficient  form  is  therefore 
presented . 

The  algorithm  can  readily  be  shown  to  be  sound  using  a proof 
similar  to  the  soundness  proof  for  (vector)  semi-unification. 

Assuming  the  completeness  of  the  unify[a;  b]  call  in  step  5b, 
the  algorithm  is  clearly  complete  (capable  of  generating  all 
unifiers) . There  is  no  difficulty  in  establishing  completeness 
in  an  induction  proof  where  neither  first  element  is  a fragment 
variable.  In  the  critical  cases  where  either  first  element  is  a 
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fragment  variable,  the  algorithm  is  exhaustive,  trying  both 
cases:  the  opposing  element  is  or  is  not  included  in  the 
substitution  for  the  fragment  variable. 


One  little  trick  in  the  algorithm  is  the  elimination  of  the 
substitution  of  the  empty  vector  for  a fragment  variable  first 
element  if  the  opposing  first  element  is  also  a fragment 
variable.  This  is  justifiable  (doesn't  result  in  loss  of 
completeness)  since  the  possibility  is  implicitly  included  in  the 
substitution  for  the  opposing  fragment  variable. 

For  example,  in  unifying  (VECTOR  ♦♦X  a)  and  (VECTOR  ♦♦Y  B), 
the  untried  substitution  {X* (VECTOR))  (and  unification  of  (VECTOR 
o)  and  (VECTOR  ♦♦Y  B))  is  an  instance  (with  variable  renaming  (Y' 
for  Y))  of  the  substitution  {Y*(VECTOR  •►♦X  ••►Y')}  (and 
unification  of  instances  of  (VECTOR  a)  and  (VECTOR  ♦♦Y'  B))  which 
is  used.  The  trick  merely  postpones  the  trial  assignment  of  the 
empty  vector  to  the  -fragment  variable  until  the  fragment  variable 
again  appears  first  in  a vector,  and  the  opposing  vector  is 
either  empty  or  has  a first  element  which  is  not  a fragment 
variable . 

The  reason  this  algorithm  does  not  always  terminate  is  that  a 
fragment  variable  may  be  matched  against  the  first  element  of  the 
opposing  vector  spawning  a new  fragment  variable  indefinitely. 
Technically,  the  semi-unification  termination  proof  breaks  down 
when  applied  to  this  algorithm  because  the  measure  (nj^,n2)  of 
variaoles  and  elements  does  not  always  decrease  when 
substitutions  are  made  for  fragment  variables. 
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An  example  of  the  application  of  this  algorithm  is  the 
unification  of  (VECTOR  -♦X  A)  and  (VECTOR  A ♦*X) . A truncated 
trace  follows. 

unifyvector I (VECTOR  --X  A);  (VECTOR  A •-X) ] 

1 unifyvector [ (VECTOR  A);  (VECTOR  A)1 
I return  { {X*  (VECTOR)  } } 

1 unifyvector I (VECTOR  ♦♦XI  A);  (VECTOR  A ♦♦XI)] 

I I unifyvector I (VECTOR  A) ; (VECTOR  A)] 

I I return  { {X^  (VECTOR  A ) ; Xl^  (VECTOR)  } } 

I I unifyvector ( (VECTOR  ♦♦X2  A);  (VECTOR  A ♦♦X2)] 

I I I unifyvector [ (VECTOR  A) ; (VECTOR  A)] 

I I I return  { {X^  (VECTOR  A A) , XI ♦ (VECTOR  A ) , X2^ (VECTOR) } } 

I I I unifyvector [ (VECTOR  ♦♦xa  A);  (VECTOR  A ♦♦XS)] 


A pathological  example  of  the  application  of  this  algorithm  is 
the  unification  of  (VECTOR  ♦♦X  B)  and  (VECTOR  A ♦♦X)  which  has  no 
unifier.  The  algorithm  does  exactly  as  before,  substituting  -for 
X either  the  empty  vector  or  the  vector  consisting  of  A and  a new 
fragment  variable  at  each  step.  However,  in  this  case 
unification  of  (VECTOR  B)  and  (VECTOR  A)  fails  and  the 
unification  algorithm  finds  no  unifier  (it  is,  after  all,  sound) 
and  never  terminates. 

The  use  of  a finite  number  of  variable  splitting  operations 
and  the  application  of  the  semi-unification  algorithm  effectively 
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imposes  a limit  on  the  number  of  times  fragment  variables  are 
permitted  to  be  matched  and  form  new  fragment  variables. 

Variable  splitting  externalizes  and  bounds  the  variable  splitting 
wnich  occurs  inside  the  vector  unification  algorithm. 


3.14.  Bag  unification 

.In  the  case  of  bags,  there  does  exist  a complete  unification 
algorithm. 

.In  discussing  this  algorithm,  we  will  assume  that  the  two  bags 
have  no  common  elements.  This  presents  no  difficulty  since  if 
two  bags  do  have  common  elements,  no  unifiers  are  lost  and 
efficiency  is  gained  if  common  elements  are  eliminated  from  the 
bags  in  pairs  until  no  common  elements  remain. 

The  bag  unification  algorithm  will  be  expressed  in  terms  of  an 
algorithm  for  the  complete  unification  of  bags  composed  only  of 
fragment  variables.  The  result  of  unifying  such  bags  is  an 
assignment  to  each  variable  of  the  bags  a bag  composed  of  some 
sequence  of  elements.  For  such  an  assignment  to  be  a unifier, 
the  only  requirement  is  that  for  each  element  used  in  any 
assignment  there  are  the  same  number  of  occurrences  of  that 
element  in  each  of  the  instantiated  bags.  For  example,  in 
unifying  (BAG  •♦X  **X  ♦♦X  **Y  *-Y)  and  (BAG  ♦♦U  ♦♦U  —U  **U  ♦♦U 
♦ ♦U  **U  ♦♦V  ♦♦V  ♦♦V)  , if  element  a is  part  of  some 

assignment  to  one  of  the  variables  X,  Y,  .U,  or  V,  then  3 times 
the  number  of  occurrences  of  a in  the  assignment  for  X plus  2 
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times  the  number  of  occurrences  of  a in  the  assignment  for  Y must 
equal  7 times  the  number  of  occurrences  of  a in  the  assignment 
for  U plus  5 times  the  number  of  occurrences  of  a in  the 
assignment  for  V. 

With  each  pair  of  bags  composed  only  of  fragment  variaoles  is 
associated  a single  equation  representing  the  number  and 
multiplicity  of  variables  in  each  bag.  For  example,  the  ecuation 
3x+2y  » 7u+5v  is  associated  with  tne  pair  of  bags  given  above. 
Tnis  equation  succintly  represents  the  condition  for  a 
substitution  to  be  a unifier;  that  the  sura  of  the  number  of 
occurrences  of  any  element  a in  each  variable  multiplied  by  the 
multiplicity  of  the  variable  in  the  bag  must  be  equal  for  the  two 
bags. 

Non-negative  integral  solutions  to  such  equations  can  be  used 
to  represent  unifiers.  The  solutions  must  be  non-negative  since 
each  variable  in  a bag  must  be  assigned  a non-negative  number  of 
occurrences  of  each  element;  it  cannot,  for  example,  nave  -1 
occurrences  of  the  constant  A.  Likewise,  the  solutions 
considered  must  be  integral  since  fractions  of  elements  are  not 
permitted  in  variable  assignments.  We  will  now  concentrate  on 
finding  an  adequate  set  of  non-negative  integral  solutions  to 
such  equations. 

Let  us  consider  the  equation  3x+2y  « 7u+5v.  Solutions  will  be 
expressed  in  terms  of  a 4-tuple.:  (a,b,c,d)  where  a,  b,  c,  and  d 
are  all  non-negative  integers  and  3a+2b»7c+5d,  i.e.,  a,  o,  c.  and 
d are  non-negative  integers  which,  when  assigned  to  variables  x, 
y,  u,  and  v respectively,  result  in  the  equation  being  satisfied. 
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Immediately  discoverable  solutions  to  the  eouotion  are 


(7, 0,3,0),  (5, 0,0, 3),  (0,7, 2,0).,  and  (0,5, 0,2)  obtained  by  taking 
pairs  of  variables  from  the  two  sides  of  the  equation,  setting 
al-l  the  other  variables  to  zero,  and  giving  the  least 
non-negative  integral  solution  (excluding  the  trivial  solution  of 
all  zeroes).  Other  solutions  are  possible  such  as  (4, 1,2,0)  and 
any  sum  of  known  solutions.  What  is  needed  is  a finite  set  of 
solutions  which  expresses  the  set  of  all  solutions. 

If  we  were  concerned  with  all  real  solutions  to  the  equation, 
these  could  be  obtained  by  taking  linear  combinations  of  any 
m+n-1  (where  m is  the  number  o-f  variables  in  the  left  side  of  the 
equation,  n is  the  number  of  variables  in  the  right  side  of  the 
equation)  linearly  independent  solutions  of  the  equation,  m+n-1 
linearly  independent  solutions  are  included  among  the  m*n 
solutions  generated  by  taking  pairs  of  variables  from  the  two 
sides  and  solving  the  resulting  equation  after  setting  all  the 
other  variables  to  zero. 

This  is  not  adequate  for  generating  all  the  non-negative 
integral  solutions  by  addition  alone  however  (we  are  concerned 
with  finding  solutions  generated  as  sums  to  satisfy  the 
non-negativity  and  integrality  constraints).  For  example,  the 
solution  (4, 1,2,0)  is  not  the  result  of  adding  any  number  of  the 
pairwise  solutions  together.  Thus,  any  set  of  m+n-1  linearly 
independent  solutions  must  be  augmented  by  additional 
non-negative  integral  solutions  so  that  all  such  solutions  may  be 
generated . 
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h’e  hypothesize  here  that  every  non-negative  integral  solutions 
can  be  formed  by  addition  alone  (or  eouival ently , by 
multiplication  oy  a positive  integer  and  addition)  from  the  set 
of  all  non-negative  integral  solutions  for  which  the  value  of  the 
left  side  of  the  eauation  (equals  the  right  side  of  the  equation, 
and  is  called  the  value  of  the  eauation)  is  less  than  or  equal  to 
the  maximum  of  the  least  common  multiples  of  the  coefficients  of 
a variable  on  the  left  side  and  a variable  on  the  right  side. 

The  value  of  the  equation  3x+2y  ® 7u+5v  is  21  (3*7).  The  set 
of  all  non-negative  integral  solutions  to  tne  equation  with  value 
less  than  or  equal  to  21,  excluding  (0,0, 0,0)  (which  is  generable 
as  the  empty  sum  of  solutions)  and  solutions  composable  from 
other  solutions  with  value  less  than  or  equal  to  21  is  (1,1, 0,1), 

(1.2. 1.0) ,  (0,5, 0,2),  (4, 0,1,1),  (0,6, 1,1),  (4, 1,2,0),  (0,7, 2,0), 
(5, 0,0, 3),  and  (7, 0,3,0).  According  to  the  hypothesis,  all 
non-negative  integral  solutions  to  the  equation  are  composable  by 
addition  from  this  set  of  9 solutions. 

Now  let  us  return  to  the  original  bag  unification  problem 
which  spawned  this  particular  equation.  Each  solution  represents 
a solution  to  the  original  bag  unification  problem.  For  example, 

(4. 1.2.0)  represents  the  solution  {X*(BAG  ♦♦T  •♦T  ♦♦T  ♦♦T) , 
y*-(BAG  ♦♦T)  , U*  (BAG  ♦♦T  ♦♦T)  , V*(BAG)]  for  arbitrary  new  variable 
T and  all  its  instances.  Thus  we  see  that  each  solution  can 
represent  a unifier  of  the  bags  using  a particular  newly  formed 
fragment  variable. 

Since  fragment  variables  can  be  empty,  we  can  freely  compose  a 
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unifier  incorporating  all  the  solutions.  Let  us  do  so.  .Je  first 
associate  a new  fragment  variable  with  each  solution: 

(X,Y,.U,V) 

(1,1, 0,1)  ♦♦Tl 
(1,2, 1,0) 

(0,5, 0,2)  ♦♦Tl 
(4, 0,1,1)  ♦♦T4 

(0,6, 1,1)  ♦♦TS 
(4, 1,2,0)  ♦♦Te 
(0,7, 2,0)  ♦♦T? 

(5, 0,0, 3)  **T8 

(7, 0,3,0) 

Then  the  first  variable  is  assigned  a bag  composed  of  fragment 
variables  in  the  number  specified  by  the  first  components  of  the 
tuple,  the  second  variable  is  assigned  a bag  composed  of  fragment 
variables  in  the  number  specified  by  the  second  component,  etc. 
Thus,  X would  be  assigned  a bag  with  1 Tl,  1 T2,  4 T4's,  4 T6's, 

5 T8's,  and  7 T9's.  The  final,  complete  unifier  is  {X*(BAG  ♦♦Tl 


♦ ♦T2 

♦ ♦T4 

♦ 

♦T4 

♦ ♦T4 

♦ ♦T4 

♦ ♦T6 

♦ ♦T6 

♦ ♦T6 

♦ ♦T6 

♦ ♦T8 

♦ ♦T8 

♦♦T8  ♦♦TB 

«.*T8 

♦ *T9 

♦ 

*T9 

♦ ♦T9 

♦ ♦T9 

♦ ♦T9 

♦ ♦T9 

♦ ♦T9) 

(BAG 

♦ ♦Tl  ♦♦T2  ♦♦T2 

♦ ♦T3 

♦ *T3 

♦ 

*T3 

**T3 

♦ ♦T3 

♦ ♦T5 

♦ ♦T5 

♦ ♦T5 

♦ ♦T5 

♦ ♦T5 

♦ ♦T5 

♦ ♦T6  ♦♦T7 

♦ *T7 

♦ ♦T7 

♦ 

*T7 

♦ ♦T7 

♦ ♦T7 

♦ ♦T7 

)*  r (BAG  ♦ 

♦T2 

♦ ♦T4 

♦♦15  **T6  ♦♦TS 

♦ ♦T7 

♦ ♦T7 

♦ 

*T9 

♦ ♦T9 

♦ ♦T9) 

(OAG  ♦ 

♦Tl  ♦ 

♦T3 

♦ ♦T3 

♦ ♦T4  ♦♦T5  ♦♦TB 

• ♦T8 

♦ *T8 

)}. 

• 

Each  i 

nstan 

tiated  bag 

has 

5 Tl 

•s,  7 

T2's, 

, 10  T3'S, 

12  T4's, 

12 

T5 

•s,  14 

T6 's 

, 14 

T7 's. 

15  T8’s, 

and 

21  T9' 

's,  110 

fragment  variable  occurrences  in  all. 
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The  important  point  is  that  the  single  unifier  presented  above 
is  the  roost  general  unifier;  every  unifier  of  the  two  bags  in  an 
instance  of  this  unifier. 

To  be  more  precise  in  the  definition  of  the  algorithm,  the 
algorithm  consists  of  the  following  steps.: 

1.  Form  an  equation  from  the  two  bags  wnere  the  coefficient  of 
each  variable  in  the  equation  is  equal  to  the  multiplicity  of 
the  corresponding  fragment  variable. 

2.  Generate  all  non-negative  integral  solutions  of  the  equation 
with  value  less  than  or  equal  to  the  maximum  least  common 
multiple  of  a coefficient  from  the  left  side  and  a coefficient 
from  the  right  side  of  the  equation. 

3.  Eliminate  all  solutions  composable  by  addition  from  other 
solutions.  Such  solutions  can  be  found  by  subtracting  from 
them  each  solution  with  lesser  value;  if  any  difference  is 
non-negative  integral,  the  solution  is  composable  from  other 
solutions  and  can  thus  be  rejected. 

4.  Associate  with  each  solution  a single  new  fragment  variable. 

5.  Assemble  a unifier  composed  of  assignments  to  the  original 
fragment  variables  of  bags  with  as  many  of  each  new  fragment 
variable  as  specified  by  the  solution  element  associated  with 
the  original  fragment  variable  and  the  new  fragment  variable. 
This  solution  must  be  generalized  in  some  way.  Although  the 

case  of  unifying  bags  composed  solely  of  fragment  variables  is 
important,  it  is  not  sufficient. 

There  does  not  appear  to  be  any  straightforward  method  of 
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performing  operations  on  a pair  of  bags  to  eliminate  all  tne 
elements  which  are  not  fragment  variables  (generating  partial 
unifiers  in  the  process)  so  that  the  above  algorithm  is 
applicable  to  the  preprocessed  input. 

One  possible  method  is  to  merely  perform  all  the  possible 
unifications  involving  elements  which  are  not  fragment  variables 
first.  For  example,  consider  unifying  (BAG  ♦♦X  ♦♦Y)  and  (BAG  A 
♦♦w) . Tne  constant  A must  be  an  element  of  either  X or  Y. 
Thus,  {X»(BAG  A ♦♦Tl)}  and  {Y»(BAG  A ♦♦T2) ) (for  new  fragment 
variables  T1  and  T2)  are  legitimate  partial  unifiers  of  the  two 
oags  representing  respectively  that  X contains  A with  remainder 
Tl  and  Y contains  A with  remainder  T2.  Tne  instantiated  bag 
pairs  with  matched  elements  removed  are  then  (BAG  ♦♦Tl  ♦♦Y)  and 
(BAG  ♦♦Z  ♦♦vO  , and  (BAG  ♦♦X  ♦♦T2)  and  (BAG  ♦♦Z  ♦♦W)  to  which  the 
above  algorithm  could  be  applied. 

However,  this  method  quickly  breaks  down.  Consider  the  bags 
(BAG  A ♦♦X  ♦♦X)  and  (BAG  ♦♦Y  ♦♦Y)  which,  incidentally,  have  no 
unifier  since  the  first  bag  clearly  must  have  an  odd  number  of 
A's  and  the  second  bag  must  have  an  even  number  of  A's.  We  make 
the  substitution  {Y*(BAG  A ♦♦Tl) ) , the  only  possible  substitution 
since,  if  the  expressions  are  unifiable,  A must  be  an  element  of 
Y.  After  instantiating  the  expressions  and  removing  the  matched 
elements,  the  bags  (BAG  ♦♦X  ♦♦X)  and  (BAG  ♦♦Tl  A ♦♦Tl)  which, 
given  the  commutativity  of  the  bag  data  type  and  the 
commutativity  of  the  unification  operation,  poses  a unification 
problem  identical  to  tne  original  one.  Thus,  some  other  approach 
is  needed. 
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The  most  straightforward  approach  which  works  is  the 
following.:  from  an  arbitrary  pair  of  bags  to  be  unified  form  a 

new  pair  of  bags  in  which  each  element  in  the  original  pair  of 
bags  which  is  not  a fragment  variable  is  replaced  by  a new 
fragment  variable.  For  example,  the  bags  (BAG  ♦♦X  •►♦Y)  and  (BAG 
♦ «.S  «.*Z  ♦♦vv)  would  be  formed  from  the  bags  (BAG  ♦♦X  ♦♦Y)  and  (GAG 
A **2  by  replacing  each  occurrences  of  A by  new  fragment 

variable  S. 

Now  solve  the  equation  associated  with  the  new  pair  of  bags. 
For  this  problem,  the  associated  equation  (x+y=s+z+w)  has  the 
following  set  of  solutions  and  associated  fragment  variables: 

(X,Y,S,Z,W) 

(1,0, 1,0,0)  **T1 

(1,0, 0,1,0)  **T2 

(1,0, 0,0,1)  **T3 

(0,1, 1,0,0)  ♦♦T4 

(0,1, 0,1,0)  ♦♦TS 
(0,1, 0,0,1)  .*T6 

The  resulting  unifier  is  thus 

{X*(BAG  ♦♦Tl  -*T2  **T3)  , Y-(BAG  ♦♦T4  **T5  **T6)  , 

S*(BAG  ♦♦Tl  ♦*T4)  , Z-(BAG  **T2  ♦♦TS)  , W--(BAG  --TZ  --*T6)}. 

Tne  final  step  of  the  algorithm  is  to  unify  in  every  possible 
way  tne  elements  replaced  by  fragment  variables  (after 
instantiating  them  by  the  returned  unifier  of  the  derived  bags  in 
case  they  contain  occurrences  of  variables  the  same  as  the 
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unified  fragment  variables)  with  the  values  of  the  fragment 
variables  returned  by  the  algorithm  for  bags  composed  entirely  of 
fragment  variables.  This  unification  presents  no  difficulty 
since  one  of  the  expressions  is  guaranteed  to  have  no  fragment 
variables  at  the  top  level  by  construction;  thus, 
semi-unification  will  suffice  here.  In  the  example,  the  value  of 
3 is  unified  with  (BAG  A).  This  results  in  unifiers  {Tl*(BAG  A), 
T4*(BAG)}  and  {Tl*  (BAG)  , T4*(BAGA)}. 

Thus  the  final  set  of  unifiers  for  the  original  two 
expressions  consists  of  {X-(BAG  A «--.-T2  ■►•••T3)  , y»(BAG  ♦♦T5  ♦♦T6)  , 
Z«-(BAG  ♦*T2  ♦.♦TS)  , W*(BAG  —TB  -♦To)}  and  {X-(BAG  ♦♦T2  — T3)  , 
y»(BAG  A ♦■.-T5  ♦♦To),  2^(BAG  ♦♦T2  —T5)  , W^(BAG  ♦♦TB  ♦♦T6)}. 

Another  example  of  the  use  of  this  bag  unification  algorithm 
is  the  unification  of  (BAG  ♦X  ♦Y)  and  (BAG  A B) . 

Replacing  each  non-fragment  variable  element  of  the  bags  by  a 
new  fragment  variable,  we  obtain  (BAG  ♦♦SI  ♦♦32)  and  (BAG  ♦♦SS 
♦♦34).  The  values  of  31,  32,  33,  and  34  will  later  be  matched  to 
(BAG  ♦X)  , (BAG  ♦Y)  , (BAG  A),  and  (BAG  6)  respectively. 

The  corresponding  equation  is  S2+S2®S3+S4  with  solutions  and 
corresponding  newly  created  fragment  variables 

(31,32,33,34) 

(1,0, 1,0)  ♦♦Tl 

(1,0, 0,1)  ♦-T2  , 

(0,1, 1,0)  ♦♦T3 

(0,1,0,!)  ♦*T4. 

Thus,  the  general  unifier  of  (BAG  ♦♦Si  ♦♦32)  and  (BAG  ♦♦S3 
♦♦S4)  is 

{Sl^  (BAG  ♦♦TJ.  ♦♦T2)  , 

32^ (BAG  ♦♦T3  ♦♦T4)  , 

33^ (BAG  ♦♦Tl  ♦♦T3)  , 

34^  (BAG  ♦♦T2  ♦♦T4)  } . 
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Unifyinq  the  value  of  SI  ((BAG  -*‘11 


■T2))  with  (BAG  ♦X)  , we 


obtain 


{ (SI  ♦(BAG 

-X)  , 

S2  ♦(BAG 

♦ ♦T3  ♦♦T4), 

S3  ♦(BAG 

♦X  ♦♦T3)  , 

S4  <.(BAG 

♦ ♦T4)  , 

T1  ♦(BAG 

♦X)  , 

T2»(flAG) 

} t 

(SI  ♦(BAG 

♦X)  , 

S2»(BAG 

♦ ♦T3  ♦♦T4)  , 

S3  ♦(BAG 

♦ ♦T3)  , 

S4  ♦(BAG 

♦X  ♦♦T4)  , 

T1  ♦(BAG) 

9 

T2^(BAG 

♦X) }}  . 

Unifying  the  value  of  S2  ((BAG  •►•►T3  ♦♦T4))  with  (BAG  ♦Y)  , we 
Obtain 


{ (SI 

♦ (BAG 

♦X)  , 

.S2 

♦ (BAG 

♦Y)  , 

.S3 

♦ (BAG 

♦X  ♦Y) 

S4 

♦ (BAG) 

9 

T1 

♦ (BAG 

♦X)  , 

T2 

♦ (BAG) 

9 

T3 

♦ (BAG 

♦Y)  , 

T4 

♦ (BAG) 

(SI 

♦ (BAG 

♦X)  , 

S2 

♦ (BAG 

♦Y)  , 

S3 

♦(BAG 

-X)  , 

S4 

♦(BAG 

*Y)  , 

T1 

♦(BAG 

-X)  , 

T2 

♦(BAG) 

9 

T3 

♦(BAG) 

9 

T4 

♦(BAG 

♦Y)  ) , 

(SI 

♦(BAG 

*X)  , 

S2 

♦(BAG 

•Y)  , 

S3 

♦(BAG 

•Y)  , 

.S4 

♦(BAG 

♦X)  , 

T1 

♦(BAG) 

9 

T2 

♦(BAG 

♦X)  , 

T3 

♦ (BAG 

*Y)  , 

T4 

♦(BAG) 

(SI 

♦(SAG 

♦X)  , 

S2 

♦(BAG 

♦Y)  , 

S3 

♦(BAG) 

9 

J4 

♦(BAG 

♦X  ♦Y) 

T1 

♦(BAG) 

9 

T2 

♦ (BAG 

-X)  , 

T3 

♦ (BAG) 

9 

T4 

♦ (BAG 

•Y) }}  . 
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Unifying  the  values  of  S3  witn  (BAG  A),  we  obtain 


{ {SI*  (BAG  A)  , 
S2*  (BAG  *Y)  , 
.S3*  (BAG  A)  , 
.S4*  (BAG  *Y)  , 
Tl*  (BAG  A)  , 
T2*  (BAG)  , 

T3* (BAG)  , 

T4*  (BAG  *Y)  , 
X*A } t 

{SI*  (BAG  -X)  , 
S2* (BAG  A) , 
.S3*  (BAG  A)  , 
S4*  (BAG  *X)  , 
Tl* (BAG) , 

T2*  (BAG  *X)  , 
T3*  (BAG  A)  , 
T4*  (BAG)  , 
Y*A} } . 


Unifying  the  values  of  S4  with  (DAG  B) , we  obtain 

{ {Sl*(aAG  A) , 

S2*  (BAG  B)  , 

S3*  (BAG  A)  , 

S4*(BAG  B)  , 

Tl*  (BAG  A)  , 

T2* (BAG)  , 

T3* (BAG)  , 

T4*  (BAG  B)  , 

X*A, 

Y*B)  , 

{SI*  (BAG  B)  , 

S2* (BAG  A) , 

33* (BAG  A) , 

S4*  (BAG  B)  , 

Tl* (BAG) , 

T2*  (BAG  B)  , 

T3*  (BAG  A)  , 

T4* (BAG)  , 

Y*A, 

X*3)  } 

or,  removing  the  now  useless  newly  formed  fragment  variables, 
{{X*A,  Y*B}  , {X*B,  Y*A} } , just  what  was  expected. 


Lemma  3.^^. 

(Xj^,...,  X|p » y 


Every  non-negative  integral  solution 
*.,yp)  to  the  eouation  a jXj^ + . . . +a^Xjp  = 
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biyi+. . . +bnyn  with  positive  integral  coefficients 

ajjj  rbj^ .....  . is  an  additive  linear  combination  of 

non-negative  integral  solutions  with  value  a^x^ + . . . +ajpXjj, 

(=biyi+. • . +bnyn)  less  than  or  equal  to  the  maximum  of  m and  n 
times  the  maximum  of  the  least  common  multiples  of  pairs  of 
numbers  one  from  a2,..,,ajj,  and  one  from  bj^ , . ,bj^. 

'^£22£*  Assume  with  no  loss  of  generality  that  the  least 
common  multiple  (-Icm)  of  a^  and  b^  is  the  maximum  of  the  least 
common  multiples  and  that  m>n. 

iProof  by  induction  on  the  value  of  a solution  k. 
k=0.  The  solution  with  k=0  with  X2=0,  ...»  yi“^»  •••' 

v„=G  is  aenerable  as  the  additive  linear  combination  of 
non-negative  integral  solutions  with  value  less  than  or  equal  to 
m*lcm(a2»bj^)  with  zero  coefficients. 

Assume  the  lemma  is  true  for  every  non-negative  integral 
solution  with  value  less  than  or  eoual  to  k.  Prove  it  is  true 
for  k . 

Case  1.  k £ m^lcmCa^fb]^)  . .In  this  case,  the  solution  is 
included  among  the  non-negative  integral  solutions  with  value 
less  than  or  ecual  to  m*lcm(a2,b]^)  and  the  lemma  is  true. 

Case  2.  k > m*  1cm  ( a^ , bj^ ) . Since  a2Xj^+.  . . +ajj^Xjjj  * k > 
m*lcm  ( a^jbj ) , and  each  aj^x^>0,  at  least  one  must  be  greater 

than  lcm(a2,bj^),  and  x^  must  be  greater  than  1cm  ( a^  ,bj^ ) /a^  . 
Similarly,  since  *3iyi+*  • • +bj^yj^  * k > m*lcm(aj^,bj^)  , and  each 
bjyj>0,  and  n<m,  at  least  one  bjyj  must  be  greater  than 
1cm  ( aj^ ,b2^ ) , and  y^  must  be  greater  than  1cm  ( a^^ , bj^ ) /bj  . Consider 
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the  solution  with  x^=lcm  ( a ,b  j ) /a  j , yj  = lcm  { a^  ,b  j ) /bj  , and  all 
other  variables  zero.  This  is  just  the  solution  in  lowest  terms 
involving  only  Xj^  and  y^  and  has  value  Icm  ( a ,b  j ) < Icm(a2fb2). 
Since  Icm  ( a^  r bj^ ) /a  1cm  ( a^  ,b  j ) /a^  and  -Icm  ( a^ , bj^ ) /bj> 

1cm  ( a^  ,b  j ) /bj  by  the  maximality  of  IcmCa^fb^^),  the  second 
solution  involving  only  Xj^  and  yj  can  be  subtracted  from  the 
first  leaving  a non-negative  integral  solution  as  result.  But 
this  difference  solution  has  value  k->lcm  ( a^  ,b  j ) <k  and  is  thus 
composable  from  solutions  with  value  less  than  or  equal  to 
m*  Icra  ( a^ » ) . Therefore,  the  solution  with  value  k>m*  1cm  ( a^.bj^ ) 
is  the  sum  of  some  solution  involving  only  Xj^  and  yj  with  value 
less  than  or  equal  to  lcm(a]^,bj^)  and  some  other  set  of  solutions 
with  value  less  than  or  equal  to  m*  1cm  ( a^ ,bj^ ) and  the  lemma  is 
true  for  this  case.  U 

The  lemma  proves  an  upper  bound  on  solution  values  that  must 
be  examined  in  the  determination  of  a complete  set  of 
non-negative  integral  solutions  which  span  the  non-negative 
integral  solution  space  by  addition.  However,  the  proven  bound 
is  greater  than  the  previously  conjectured  bound  for  which  we  do 
not  have  a proof.  Although  a proof  for  the  tighter  bound  would 
be  desirable,  it  should  be  noted  that  a lower  proven  bound  does 
not  reduce  the  number  of  found  solutions  theoretically  necessary 
(or,  equivalently,  the  number  of  introduced  fragment  variables), 
but  only  decreases  the  cost  of  computing  them.  This  is  true 
since  any  additional  solutions  discovered  using  a higher  bound 
than  necessary  must  be  composable  from  solutions  bounded  by  any 
lower  proven  bound  and  would  therefore  be  omitted. 
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•It  should  also  he  noted  that  the  conjectured  bound  is  a lower 
bound  on  solution  values  which  must  oe  examined,  i.e.,  solutions 
with  at  least  this  value  need  to  be  examined.  This  is  because 
one  of  the  needed  solutions  not  otherwise  generable  is  the 
pairwise  solution  involving  only  the  variables  with  those  two 
coefficients  with  the  maximum  least  common  multiple  and  having 
value  equal  to  the  conjectured  bound. 

The  solutions  to  examples  presented  here  are  complete,  and  the 
needed  equation  solutions  were  discovered  within  the  lower 
conjectured  bound  and  no  additional  (not  otherwise  composaole) 
solutions  were  discovered  within  the  higher  proven  bound. 

Theorem  3^7.  The  unification  algoritnm  for  oags  composed 
entirely  of  fragment  variables  always  terminates,  is  sound,  and 
is  complete. 

Proof.  The  algorithm  is  guaranteed  to  terminate  since  it 
performs  a finite  number  of  operations  on  the  finite  number  of 
non-negative  integral  solutions  generated  from  the  equation 
corresponding  to  the  two  bags.  The  generation  of  these  solutions 
is  finite  due  to  the  trial  solution  values  being  bounded. 

The  algorithm  is  sound  since  each  solution  of  the  derived 
equation  causes  the  introduction  into  each  of  the  instantiated 
bags  of  an  equal  number  of  new  fragment  variable  occurrences. 
Thus,  the  two  instantiated  bags  have  the  same  number  of 
occurrences  of  each  new  fragment  variable  and  are  therefore 
unified  . 

Let  k be  the  cardinality  of  the  set  of  elements  of  the  unified 


257 


bags  for  any  solution  to  the  unification  of  two  bags  composed 
entirely  of  fragment  variables.  The  two  instantiated  bags  must 
have  an  equal  number  of  occurrences  of  each  element.  That  is, 

^lCil  + **-+a^„Ci„  = ‘'l‘^il  + --*+bndin  (l<i<k)  where 
aj^,  . . . , a,j  ,bj^ , . . . ,bj^  are  the  multiplicities  of  the  fragment 
variables  in  the  bags  as  before,  Cj^j  is  the  number  of  occurrences 
(in  the  unifying  substitution)  of  element  i in  fragment  variable 
j from  the  first  bag,  and  d^j  is  the  number  of  occurrences  of 
element  i in  fragment  variable  j from  the  second  bag. 

Each  tuple  ( c . . . ,c  ,d  ^ . . . ,d  is  a solution  to  the 

equation  +. . . +ajjjX,j|=bj^y . . +b^yj^  corresponding  to  the  two 
bags.  It  can  thus  (according  to  the  lemma)  be  formed  as  the  sum 
of  certain  non-negative  integral  solutions  to  the  equation 
weighted  by  positive  integers. 

Include  in  the  value  of  the  new  fragment  variable  associated 
with  each  of  these  solutions  a number  of  occurre  .ces  of  element  i 
equal  to  the  coefficient  of  the  solution  in  the  weighted  sum. 

This  will  result  in  the  assignment  of  c^j  occurrences  of  element 
i to  each  fragment  variable  j of  the  first  bag  and  d^j 
occurrences  of  element  i to  each  fragment  variable  j of  the 
second  bag. 

Do  this  for  each  of  the  k elements  in  the  solution.  Let  no 
other  or  additional  elements  be  included  in  the  values  of  the  new 
fragment  variables.  In  particular,  assign  the  empty  bag  to  tnose 
fragment  variables  not  used  above. 

This  assignment  of  elements  in  the  solution  to  new  fragment 
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d 


I 


variables  associated  with  equation  solutionc  generated  in  tr.e 

V 

unification  process  results  in  the  correct  number  c--  or  a.  of 

1 j 1 j 

each  element  being  assigned  each  fragment  variable  of  the 
original  two  bags. 

Tnus,  any  solution  to  the  unification  of  two  bags  is  an 
instance  of  the  returned  unifier  and  the  algorithm  is  complete. 


The  final  theorem  for  bag  unification  relates  to  its 
termination,  soundness,  and  completeness  in  the  general  case.  We 
cannot  prove  termination,  soundness,  and  completeness  for  a 
completely  arbitrary  selection  of  elements  of  the  two  bags  if 
such  results  are  not  proven  for  unification  of  the  elements  among 
themselves  directly.  This  is  really  no  restriction  on  the 
algorithm's  usefulness;  no  more  powerful  result  could  be  proved 
for  any  bag  unification  algorithm.  For  example,  we  cannot  expect 
to  completely  unify  (BAG  (VECTOR  A ■>— X))  and  (BAG  (VECTOR  --X  A)) 
just  because  the  vectors  are  included  in  bags  presented  to  a 
"complete"  bag  unification  algorithm  if  we  cannot  completely 
unify  (VECTOR  A *»X)  and  (VECTOR  **X  A)  directly. 

What  the  theorem  states  is  that  the  algorithm  introduces  no 
additional  non- termination , unsoundness,  or  incompleteness.  For 
example,  the  semi-unification  algorithms  terminate,  are  sound, 
and  are  complete  for  expressions  containing  no  fragment 
variables.  Tnus,  bag  unification  for  bags  composed  of  fragment 
variables  and  fragment  variable  free  elements  terminates,  is 
sound,  and  is  complete  when  using  semi-unification  in  the 
recursive  calls  to  unification  algorithms. 
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JliSSE®!!!  3^8.  The  bag  unification  algorithm  always  terminates, 
is  sound,  and  is  complete  (provided  unification  of  elements  is). 

Let  a and  b be  any  two  bags  being  unified.  Let  a*  and 
b*  be  the  bags  resulting  from  replacing  elements  which  are  not 
fragment  variables  by  new  fragment  variables,  a*  and  b*  are 
generalizations  of  a and  b respectively,  i.e.,  a*e=a  and  b*6=b 
for  some  0 of  the  form  { . . . ,x^-(BAG  yj^)  , . . . } where  each  is  a 
new  fragment  variable  and  each  y^  is  the  expression  in  a or  b it 
replaces  in  a or  b . 

Let  o be  the  unifier  of  a*  and  b*  returned  by  the  unification 
algorithm  for  bags  composed  entirely  of  fragment  variables. 
According  to  the  previous  theorem,  bag  unification  terminates,  is 
sound,  and  is  complete  for  bags  composed  entirely  of  fragment 
variables. 

Let  0 ' be  e with  substitution  o applied  to  each  variable  value 
of  0.  This  operation  is  necessary  to  cover  the  possibility  that 
some  element  of  the  bags  includes  an  occurrence  of  a variable 
also  occurring  as  a fragment  variable  element  of  the  bags  which 
is  thus  assigned  a value  in  a. 

is  an  instance  of  a and  b*e'  is  an  instance  of  b. 
Nevertheless,  every  unifier  of  a and  b is  also  a unifier  of  a*0' 
and  b*0'  since  0 (by  which  9 was  instantiated  to  obtain  9')  is 
the  most  general  unifier  of  generalizations  a*  and  b*  of  a and  b. 

We  now  have  (by  virtue  of  the  instantiation  of  9 by  a)  two 
substitutions  (9*  and  a)  for  variables  of  a*  and  b*  neither  of 
which  includes  in  variable  values  any  variable  being  substituted 
for  in  eitner  substitution. 
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Every  unifier  of  a and  b is  a simultaneous  instance  of  e*  and 
0.  Thus,  all  unifiers  can  be  generated  by  unifying  in  all 
possible  ways  variable  values  in  e*  and  a corresponding  to  the 
same  variables. 

Let  be  the  variables  substituted  for  in  6'  and  o and 

let  yi,...,yj^  and  be  the  corresponding  values  in  e*  and 

o respectively.  (Note  that  o contains  a substitution  pair  for 
every  fragment  variable  in  a*  and  b*;  6'  contains  substitution 
pairs  only  for  new  fragment  variables  introduced  in  the  formation 
of  a*  and  b* . For  variables  Xj^  substituted  for  in  a but  not  in 
e'  we  will  assume  y^  to  be  Xj^,  i.e.,  the  null  substitution  will 
be  used.) 

I C A ...  A € 

unify  ( ..  ;Z|^  •••  ®k-l  ^ ^ thus  the  list  of  all  unifiers 

of  a and  b.  The  algorithm  thus  terminates,  is  sound,  and  is 
complete  if  each  recursive  call  on  unify  terminates,  is  sound, 
and  is  complete. 

Let  us  examine  the  arguments  of  these  recursive  calls  more 
closely.  Each  y^  is  a bag  consisting  of  a single  element  which 
is  not  a fragment  variable  and  each  is  a bag  consisting  of 
fragment  variables.  Thus,  every  is  merely  an 

instance  of  y^^  having  a single  element  which  is  not  a fragment 
variable  and  every  aj^. . . is  a bag  consisting  of  fragment 

variables  and  instances  of  elements  of  Zj^, . . . , z^_j^ . 

.In  unifying  and  two  cases  are 

possible.  .In  the  first  case,  the  single  element  of 
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is  assigned  as  the  single  bag  element  of  a fragment  variable  of 


and  the  remainder  of  . .0  is  unified  with  the 

empty  bag.  This  case  can  be  handled  with  termination,  soundness, 
and  completeness  by  semi- uni f ication . In  the  second  case,  the 
element  of  yi®i--*®i_i  is  unified  against  an  element  of 

which  is  not  a fragment  variable  and  is  itself  an 
instance  of  some  Zj . Here,  the  assumption  of  the  theorem  is 
required  to  complete  the  argument  for  total  termination, 
soundness,  and  completeness  of  the  algorithm. 

Thus,  the  algorithm  always  terminates,  is  sound,  and  is 
complete  provided  pairs  of  elements  from  the  two  bags  can  be 
unified  with  termination,  soundness,  and  completeness.  Li 

3.15.  AC  unification 

We  present  here  an  algorithm  (called  the  AC  unification 
algorithm)  for  unifying  two  terms  whose  function  is  associative 
and  commutative.  Terms  will  be  represented  as  if  the  function 
had  an  arbitrary  number  of  arguments  with  no  superfluous 
parentheses . 

We  will  assume  that  the  argument  lists  of  the  two  terms  being 
unified  have  no  common  arguments.  This  presents  no  difficulty 
since  no  unifiers  are  lost  and  efficiency  is  gained  if  common 
arguments  are  eliminated  immediately.  This  is  done  by  removing 
common  arguments  a pair  at  a time,  one  from  each  of  the  argument 
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lists.  For  example,  before  unifying  f(xxyabc)  and  f(bbDcz),  the 
b's  common  to  the  two  terms  are  removed  yielding  f(xxyac)  and 
f(bbcz)  , and  the  c's  common  to  the  two  new  terms  are  removed 
yielding  f(xxya)  and  f(bbz).  An  example  of  the  utility  of 
immediately  removing  common  arguments  is  the  unification  of 
f(g(x)y)  and  f(g(x)g(a)).  If  the  g(x)'s  common  to  the  two  terms 
are  immediately  removed,  the  unification  algorithm  will  return 
the  most  general  unifier  {y-^Ca)}.  If  the  common  g(x)'s  are 
retained,  unification  will  likely  result  in  the  generation  of  the 
additional  less  general  unifier  { x ♦a  ,y  *g  ( a)  } . 

The  algorithm  will  be  expressed  partially  in  terms  of  an 
algorithm  for  the  complete  unification  of  terms  with  an 
associative  and  commutative  function  with  only  variables  as 
arguments.  The  result  of  unifying  such  terms  is  an  assignment  to 
each  variable  of  the  terms  some  sequence  of  terms.  Each  variable 
is  assigned  a term  t^  (whose  function  symbol  is  not  f)  or  a term 
f ( tj'^l. . . tjj,"ra)  (with  n^  occurrences  of  term  t^  as  arguments  of 
f)  . For  such  an  assignment  to  be  a unifier,  the  only  requirement 
is  that  for  each  term  t^  used  in  any  assignment  there  are  the 
same  number  of  occurrences  of  that  term  occurring  as  arguments  of 
f in  each  of  the  unified  terms  instantiated  by  the  assignment. 

For  example,  in  unifying  f ( Xj^X2X2 ) and  f(y]^y2^y2)»  if  term  t is 
part  of  some  assignment  to  one  of  the  variables,  then  2 times  the 
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times  the  number  of  occurrences  of  t in  the  assignment  for 
plus  the  number  of  occurrences  of  t in  the  assignment  for  y2- 
For  example,  {xj^*f(bb),  X2»f(ab),  x^^-a,  y^'^b,  y 2*f  ( aaobb)  } is  a 
unifier  of  f {xj^Xj^X2X2 ) and  f(y2yiy2)  since  there  are  2 a's  and  5 
b’s  in  the  instantiations  of  f ( xj^Xj^X2X3 ) and  f(yj^y]^y2)  with  the 
unified  term  being  f(aabbbbb). 

With  each  pair  of  terms  with  an  associative  and  commutative 
function  with  only  variable  arguments  is  associated  a single 
equation  representing  the  numoer  and  multiplicity  of  variables  in 
each  term.  For  example,  the  eouation  2x^+X2+X2  = 2y2+y2  is 
associated  with  the  pair  of  terms  given  above.  This  equation 
succintly  represents  the  condition  for  a substitution  to  be  a 
unifier:  that  the  sum  of  the  number  of  occurrences  of  any  term  in 
the  value  of  each  variable  multiplied  by  the  multiplicity  of  the 
variable  in  the  term  must  be  equal  for  the  two  terms. 

Non-negative  integral  solutions  to  such  equations  can  be  used 
to  represent  unifiers.  The  solutions  must  be  non-negative 
integral  since  each  variable  must  be  assigned  a non-negative 
integral  number  of  occurrences  of  each  term. 

In  order  to  generate  all  the  solutions  to  the  problem  of 
unifying  the  two  terms,  it  is  necessary  to  be  able  to  represent 
all  the  solutions  to  the  equation  derived  from  the  terms.  Every 
non-negative  integral  solution  to  the  equation  is  representable 
as  a sum  of  elements  of  a particular  -finite  set  of  non-negative 
integral  solutions  to  the  equation,  i.e. , every  non-negative 
integral  solution  to  the  equation  is  a sum  (equivalently,  a sum 
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with  non-negative  integral  weights)  of  elements  of  a particular 
finite  set  of  non-negative  integral  solutions.  The  finite  set  of 
non-negative  integral  solutions  by  whose  addition  the  entire 
non-negative  integral  solution  space  is  spanned  is  generable  by 
generating  in  ascending  order  of  value  solutions  to  the  equation, 
eliminating  those  solutions  composable  from  those  previously 
generated.  This  process  can  be  made  finite  by  placing  a bound  on 
the  maximum  solution  value  which  will  be  used;  such  a maximum  was 
proved  in  Lemma  3.1  to  eliminate  no  needed  solutions. 

Consider  the  equation  2xj^-»-x2+X3  ® 2y2^+y2.  Solutions  to  the 
equation  are: 


1 

2 

3 

4 

5 

6 
7 


^1  2£2  £3  i^l 

0 0 10 


0 10  0 


0 0 2 1 


0 111 
0 2 0 1 


1 0 


0 0 


10  0 1 


Y2  2sit3i2±£3  2^1+22 


111 
111 
0 2 2 
0 2 2 
0 2 2 
2 2 2 
0 2 2 


^1 

^2 

*3 

*4 

^5 

*6 

^7 


Associated  with  each  solution  above  is  a new  variable  (in  the 
rightmost  column) . The  assignment  of  as  many  occurrences  of  that 
variable  as  specified  in  the  solution  to  each  of  the  variables  of 
the  original  terra  results  in  a partial  solution  to  the 
unification  of  the  the  original  terms.  In  particular,  the 
assignment  of  2 occurrences  of  variable  Z3  to  X3  and  1 occurrence 
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to  results  in  an  eaual  number  of  occurrences  of  variable  in 
each  of  f(x^Xj^X2X^)  and  f(y2Y2)- 

every  non-negative  integral  solution  to  the  eauation  is  a 
(non-negative  integer  weighted)  sum  of  the  7 solutions  presented 
above,  i.e.,  every  solution  is  representable  as 

X3=zi+2z3+Z4,  y 3=Z3+z  4+z  5+Z7,  y2  = Z3+Z2+2z5  for  some 
non-negative  integral  values  of  Zj^, . . . ,Zq.  However,  not  every 
solution  to  the  equation  is  a solution  to  the  unification  problem 
for  which  the  equation  was  derived..  There  is  an  additional 
constraint  that  each  variable  of  the  original  terms  must  be  have 
at  least  one  term  in  its  value;  it  cannot  have  zero  terms  in  its 
val ue . 

Hence,  we  must  form  that  subset  of  the  2^*126  sums  for  wnich 
each  element  of  the  5-tuple  is  non-zero.  (It  is  not  necessary  to 
consider  sums  in  which  any  solution  has  a coefficient  other  than 
0 or  1 since  such  solutions  (in  the  unification  problem)  are 
already  representable  since  the  solution's  inclusion  with 
coefficient  1 introduces  a variable  which  can  have  as  its  value 
an  arbitrary  number  of  terms  as  arguments  of  f thus  simulating 
the  case  of  the  coefficient  being  greater  than  1.)  There  are  69 
such  sums  including  for  example  (representing  the  sum  by  the  set 
of  its  indices)  {2,3,6},  (1,2, 3, 6},  and  (4,6)  with  associated 
un  i f i e r s 

{^^♦Zg,  X2*Z2,  ' ^1*’^3'  ^ ^ ^ 2^  6^  6^  ^ ' 

l^l-Zg,  X2*Z2,  X3-f  ( Z2Z3Z3)  , y3''-Z3,  y2»f  ( Z3Z2ZgZg)  } , and 

(Xi-^Zg,  X2*z^,  X2*z^.  yi*Z4,  y2*f(ZgZg)}. 


Kote  that  if  a variable  could  hove  as  its  value  zero  term 


1 


rather  than  one  or  more  termr.  as  in  the  first  order  predicate 
calculus,  it  would  bo  unnecessary  to  form  this  subset  of  2^ 

(where  n is  tne  number  of  solutions)  sums.  Only  the  sum  of  all 
the  solutions  would  be  reauired  since  any  variable  present  in 
this  sum  could  have  value  zero,  and  the  variables  in  the 
corresponding  unifier  could  be  matched  against  zero  terms.  Tnis 
is  the  situation  with  fragment  variables  in  the  bag  data  type  in 
CA4  and  OLISP  [43,371  . 

To  be  more  precise  in  the  definition  of  the  algorithm,  the 
algorithm  consists  of  the  following  steps: 

1.  Form  an  equation  from  the  two  terms  where  the  coefficient  of 
each  variable  in  the  equation  is  equal  to  the  multiplicity  of  the 
corresponding  variable  in  the  term. 

2.  Generate  all  non-negative  integral  solutions  to  the  equation 
eliminating  all  those  solutions  composable  from  other  solutions. 

3.  Associate  with  each  solution  a new  variable. 

4.  For  each  sum  of  the  solutions  (no  solution  occurring  in  the 
sum  more  than  once)  with  no  zero  components  assemble  a unifier 
composed  of  assignments  to  the  original  variables  with  as  many  of 
each  new  variable  as  specified  by  the  solution  element  in  the  sum 
associated  with  the  new  variable  and  the  original  variable. 

Now  we  present  the  complete  algorithm  for  unifying  general 
terms  with  associative  and  commutative  functions  using  the 
algorithm  for  the  variable  only  case  above.  We  are  here 
concerned  with  terms  whose  function  is  associative  and 
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arguments  that  may 


commutative  with  arbitrary  arguments,  i.e., 
contain  ordinary  (non-associative , non-commutative)  functions  or 
f or  other  functions  which  are  associative  and  commutative.  We 
assume  the  presence  of  ordinary  unification  to  deal  with  those 
aspects  of  the  unification  problem  not  dealt  with  explicitly 
here . 

First,  when  unifying  two  terms,  two  new  terms  with  only 
variable  arguments  are  formed  by  uniformly  replacing  distinct 
arguments  by  new  variables.  Tnese  new  terms  have  only  variable 
arguments  and  are  generalizations  of  the  original  two  terms.  For 
example,  in  unifying  f(xxya)  and  f(bb2),  we  form  generalizations 
f(X2^X2X2X2)  and  f(y2yiy2)  with  substitution  { x^  ♦x , X2*y,  x^^a, 
yi»b,  y2*2}  instantiating  the  new  terms  to  the  original  terras. 

Next,  using  the  previous  algorithm  for  the  variable  only  case, 
we  unify  the  generalizations  of  the  original  terms.  This  has 
already  been  done  for  the  example  above  resulting  in  6i>  unifiers 
as  stated  previously. 

Now  we  have  the  generalizations  of  the  two  original  terms,  a 
substitution  to  instantiate  them  to  the  original  terms,  and  a 
complete  set  of  their  unifiers.  Every  unifier  of  the  original 
terms  is  a simultaneous  instance  of  the  substitution  to 
instantiate  the  generalizations  to  the  original  terms  and  a 
unifier  of  the  generalizations.  So  all  that  is  necessary  to  get 
all  the  unifiers  of  the  original  terms  is  to  unify  (for  each 
variable  being  substituted  for)  the  value  in  the  substitution  and 
the  value  in  the  unifiers. 
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In  the  example,  x-j  must  have  value  a and  must  have  volue  h. 
Thus,  any  unifier  of  f(X2XiX2X3)  and  f(yjyiy2)  which  assign.-  to 
X3  or  y2  a non-variable,  i.e.,  a term  of  the  form  f{...)  may  be 
immediately  excluded  from  consideration  since  the  unification  of 
it  with  the  assignment  including  X3 -a  and  y^-b  will  fail.  (This 
constraint  could  be  applied  during  the  generation  of  sums  of 
solutions  to  the  equation  rather  than  afterwards.)  This 
constraint  eliminates  63  of  the  69  unifiers,  leaving  sums  (1) 
{4,6},  (2)  {2,4,6},  (3)  {1,5,6},  (4)  {1,2, 5, 6},  (5)  {1,2,7},  and 
(6)  {1,2, 6, 7}  with  associated  unifiers 

(1)  X2*24,  X3*24,  y3'^4»  y2‘^^^^6^6^  ^ ' 

(2)  {Xj^-Zg,  X2»f(22Z4),  X2*Z4,  yi*24»  Y 2*^  ( ^2^  6^  6^  ^ ' 

(3)  lxi*Zg,  X2»f(Z5Z5),  X3*Z3,  y^-^Zs#  y 2 ♦f  ( 232  5^  6 ^ ^ ' 

(4)  {xj^-Zg,  X2 -f  ( Z2Z5Z5)  , X3*Z3,  y^^Zj,  y2*f  (Z3Z2ZgZg)  } , 

(5)  {xj^-z^,  X2*Z2»  X3*Z3,  Yi*'^-i>  Y ^ 

(6)  {Xj^ -f  ( zgz^)  , X2*Z2,  X3-Z3,  yx^z-^,  y2*f  ( z^z  2ZgZg)  } . 

Unifying  each  of  these  with  {x^-x,  X2*y,  X3^a,  y^-^b,  y2*z}  , we 
obtain 

(1)  no  unifier  since  24-3  and  Z4*b  are  not  unifiable, 

(2)  no  unifier  since  z^«-a  and  Z4*b  are  not  unifiable, 

(3)  {x-Zg,  yf(bb),  z*f(azg2g)}  (=  {y*f(bb),  z*f(axx)}), 

(4)  {x»Zg,  y»f(bbZ2),  2 ♦f  (a22ZgZg)  } (=  {y*f(bbZ2),  z *f  ( az2xx)  } ) , 

(5)  {x*b,  Y*^2’  z*f(a22)}  (*  {x-b,  z-f(ay))}),  and 

(6)  {x*f(b2g),  y22»  z-f  (a22Zg2g)  } (=  { x *f  { bzg)  , z *f  ( ayz  gZ  g)  } ) . 
This  is  a complete  set  of  unifiers  of  f (xxya)  and  f(bbz)  . 

Since  X3  and  y^  of  the  variable  only  case  correspond  to  a and 
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b respectively,  and  a and  b are  not  unifiable,  any  sum  including 
solution  4 to  the  equation  2x2^+X2+X2  = 2yj^+y2  can  be  excluded 
from  consideration  since  it  would  require  (as  in  (1)  and  (2) 
above)  the  unification  of  a and  b.  As  with  the  constraint  on 
variables  corresponding  to  non-variable  terms  not  being  assigned 
more  than  one  variable  (terms  of  the  form  f(....))  in  the  variable 
only  case,  this  latter  constraint  on  solutions  can  be  applied 
during  the  generation  of  unifiers  in  the  variable  only  case 
rather  than  afterwards.  Elimination  of  solution  4 before 
generation  of  the  2^  sums,  and  elimination  of  suras  which  do  not 
meet  the  first  constraint  would  result  in  the  formation  only  of 
unifiers  (3),  (4),  (5),  and  (b)  of  the  variable  only  case,  each 
of  which  has  a corresponding  unifier  in  the  general  case. 

More  precisely,  the  algorithm  consists  of  the  following  steps: 

1.  corm  generalizations  of  the  two  terms  replacing  each  distinct 
argument  by  a new  variable. 

2.  use  the  algorithm  for  the  variable  only  case  to  generate 
unifiers  for  the  generalizations  of  the  two  terms.  The  variable 
only  case  algorithm  may  be  constrained  to  eliminate  the 
generation  of  unifiers  assigning  more  than  one  term  to  variables 
whose  value  must  be  a single  term,  and  the  generation  of  unifiers 
Which  will  require  the  later  unification  of  terms  which  are 
obviously  not  unifiable. 

3.  Unify  for  each  variable  in  the  substitution  from  step  1 and 
the  unifiers  from  step  2 the  variable  values  and  return  the 
resulting  assignments  for  variables  of  the  original  terms.  This 
is  a complete  set  of  unifiers  of  the  original  terms. 
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We  will  first  establish  the  validity  of  eliminating  arguments 
common  to  the  two  terms.  This  will  be  done  by  proving  that  any 


unifier  of  the  terms  is  a unifier  of  the  terms  with  a pair  of 
common  arguments  removed  and  vice  versa. 

bet  Sj^,  . . . ,Sjj,  ,t2»  • f tp  be  terms  with  Sj^*tj  for 
some  i,j..  Let  e be  a unifier  of  and  Ht-^.  , and 

let  o be  a unifier  of  b ( Si- • • s s^^^)  and 

f : t^. . .tj_^tj^2 • • * Then  (1)  6 is  a unifier  of 

f (s  . .s^_j^s^^2^ Sj^)  and  f ( t . . t j_j^t  . . . t^^)  , and  (2)  o is  a 

unifier  of  f(S2...Sj^)  and  f(t2...t^).. 

iProof . 

1.  f 9f  (sj^. . . .Sjj^)  e)  = f (sj^. . .Sj^)  e = f ( tj^. . .t^^)  e 

= f ( tj  ef  { t^. . .t  j_j^t  . . .tj^)  9)  , and  Sj^9=tj9.  Therefore 

f (S]^.  . .s^_j^Sj,  . . .Sjj,)  9 * f ( t^.  . .t  j_^t  — ^n^  ® 9 is  a 

unifier  of  f ( s^.  . . ...Sj,,)  and  f ( t^ . . • t t . ...  t^,)  . 

2.  ^f®l’**®i  — l®i+l***®ni)  V ~ l^j+l***^n^  ® and 

Sj^o=tja.  Therefore  f ( s^  cf  ( s . . .s^^)  o)  = f(Sj^...s^)a  = 

f{t2...tf^)o  » f ( t j of  ( t^- . • t j_2^t  j + 2^ . . . tj^)  o)  and  o is  a unifier  of 
f(S2...Sjn)  and  f{t2...tn).  D 

Theorem  3j_10.  The  AC  unification  algorithm  for  terms  with 
associative  and  commutative  function  with  only  variables  as 
arguments  always  terminates,  is  sound  (returns  no  substitutions 
wnich  are  not  unifiers),  and  is  complete  (every  unifier  is  an 
instance  of  a returned  unifier) . 

££22l*  Tne  algorithm  is  guaranteed  to  terminate  since  it 
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performs  a finite  number  o^f  operations  on  the  finite  number  of 
non-negative  integral  solutions  generated  from  the  equation 
corresponding  to  the  two  terms.  The  generation  of  these 
solutions  is  finite  due  to  the  trial  solution  values  being 
bounded . 

The  algorithm  is  sound  since  each  solution  of  the  derived 
equation  causes  the  introduction  into  each  of  the  instantiated 
terms  of  an  equal  number  of  new  variable  occurrences.  Thus,  the 
two  instantiated  terms  have  the  same  number  of  occurrences  of 
each  new  variable  and  are  therefore  unified. 

Any  unifier  must  assign  to  each  variable  a term  of  the  form  t^ 
(whose  function  symbol  is  not  f)  or  a term  f ( t^*^!.  . . tj^'^m)  (with 

occurrences  of  term  t^  as  arguments  of  f)  . Let  k be  the 
cardinality  of  the  set  of  such  terms  t^  in  any  solution  to  the 
unification  of  a pair  of  terms  with  only  variables  as  arguments. 
The  two  instantiated  terms  must  have  an  equal  number  of 
occurrences  of  each  of  these  k terms  as  arguments  of  f.  That  is, 
aiC  . i + . . .+a„c  ■ » b,  dj , +.  . . +b„d _ |l<i<k)  where  m is  the  number 

of  distinct  variables  in  the  first  term  being  unified,  n is  tne 
number  of  distinct  variables  in  the  second  term,  aj  is  the 
multiplicity  of  the  j variable  in  the  first  term,  bj  is  the 
multiplicity  of  the  variable  in  the  second  term,  c^j  is  the 
number  of  occurrences  of  tern  i in  variable  j in  the  first  term, 
and  dj^j  is  the  number  of  occurrences  of  term  i in  variable  j in 
the  second  term. 

Each  tuple  ,d^2»  • • • »^in^  ^ solution  to  the 


equation  +.  . . = ^ly], ■*■••• corresponding  to  the  terms 

being  unified.  It  can  thus  (according  to  Lemma  3.1)  be  formeu  as 
the  sum  of  certain  non-negative  integral  solutions  to  the 
equation  weighted  by  positive  integers. 

Consider  the  unifier  corresponding  to  the  sum  of  all  tnose 
solutions  to  the  ecuation  which  are  required  in  the  formation  of 
any  of  the  tuples  • • • fC  ,d  will  snow  that 

the  nypotnesized  unifier  is  an  instance  of  this  unifier  returned 
by  the  algorithm. 

Include  in  the  value  of  the  new  variable  associated  with  each 
of  these  solutions  a number  of  occurrences  of  term  i equal  to  the 
coefficient  of  the  solution  in  the  weighted  sum.  This  will 
result  in  the  proper  assignment  of  Cj^j  occurrences  of  term  i to 
each  variable  j of  the  first  term  and  d^j  occurrences  of  term  i 
to  each  variable  j of  the  second  term. 

Do  this  for  each  of  the  k terms  in  the  solution.  Let  no  other 
or  additional  terms  be  included  in  the  values  of  the  new 
variables . 

This  assignment  of  terms  in  the  solution  to  new  variables 
associated  with  equation  solutions  generated  in  the  unification 
process  results  in  the  correct  number  Cj^j  or  d^j  of  each  term 
being  assigned  to  each  variable  of  the  original  two  terms. 

Thus,  any  solution  to  the  unification  of  two  terms  with  only 
variables  as  arguments  is  an  instance  of  a returned  unifier  and 
the  algorithm  is  complete,  j 

Theorem  3^11.  The  AC  unification  algorithm  for  general  terms 


273 


with  associative  and  commutative  function  always  terminates,  is 
sound,  and  is  complete. 

Proof.  Let  s and  t be  any  two  terms  being  unified.  Let  s* 
and  t^  be  the  terms  resulting  from  replacing  each  distinct  term 
by  a new  variable,  s*  and  t*  are  generalizations  of  s and  t 
respectively,  i.e.,  s*®=s  and  t*®=t  for  some  6 of  the  form 
{.  ...  , . . . } .where  each  is  a new  variable  and  each  c^  is 

the  term  in  s or  t it  replaces  in  s*  or  t* . 

Let  {Oj}  denote  the  unifiers  of  s*  and  t*  returned  by  the 
unification  algorithm  for  terms  with  associative  and  commutative 
function  with  only  variables  as  arguments.  Each  Oj  is  of  the 
form  { . . . ,Xi*d^ , . . . } where  each  Xj  is  a variable  of  s*  or  t*  and 
dj^  is  the  term  assigned  to  it  by  the  unification  algorithm. 
According  to  the  previous  theorem,  unification  terminates,  is 
sound,  and  is  complete  -for  this  case. 

Simultaneous  instances  of  e and  Oj  represent  unifiers  of  s and 
t since  s*a=s,  t*0=t,  and 

•Unifying  each  c^  with  each  d^  of  a returned  unifier  Oj  of  s* 
and  t*  results  in  (by  the  assumption  of  termination,  soundness, 
and  completeness  of  the  recursive  call  on  the  unification 
algorithm  for  terms  of  lesser  complexity)  a complete  set  of 
unifiers  for  the  original  terms  s and  t.  0 

We  have  presented  an  algorithm  for  unifying  general  terms  with 
associative  and  commutative  function.  Vv’e  have  proven  that  the 
algorithm  is  guaranteed  to  terminate,  is  sound,  and  is  complete. 

The  advantages  of  this  algorithm  as  compared  to  other 
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approaches  to  unifying  such  terms  are  that  the  associativity  and 
commutativity  properties  need  not  be  axiomatized  and  that  all  the 
unifiers  of  a pair  of  such  terms  are  immediately  returned 
eliminating  the  unnecessary  and  redundant  computation  often 
occurring  in  other  approaches  wnich  generate  only  some  of  the 
unifiers  at  each  step  with  no  indication  of  when  all  the  unifiers 
have  been  generated. 

3.16.  Class  unification 

.It  is  possible  to  write  a complete  class  unification  algorithm 
in  the  style  of  the  previously  presented  non- terminating  vector 
unification  algorithm.  The  algorithm  presented  below  is  sound 
and  complete  though  non- terminating  just  as  the  vector 
unification  algorithm  was.  However,  adding  extra  constraints  on 
substitutions  for  fragment  variables  (not  described  in  the 
algorithm  proper,  but  presented  -later)  eliminates  this 
difficulty,  resulting  in  an  algorithm  wnich  is  sound,  complete, 
and  guaranteed  to  terminate. 

Since  class  unification  requires  an  element  of  one  class 
possibly  to  be  matched  against  more  than  one  element  of  another 
class,  matched  elements  cannot  be  simply  removed  and  disregarded. 
Previously  matched  elements  must  be  retained.  They  may  be  kept 
in  an  additional  variable  in  the  algorithm  or,  for  simplicity  of 
description  and  implementation,  be  retained  in  the  original 
classes  themselves  as  in  the  algorithm  being  presented  here.  Tne 
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following  conventions  are  followed  by  the  algorithm:  previously 
matched  elements  of  two  classes  are  retained  in  the  right  end 
(tail)  of  the  two  classes  and  the  leftmost  of  any  two  duplicate 
elements  in  a single  class  is  immediately  eliminated.  Two 
classes  are  fully  unified  when  they  are  identical  (all  the 
elements  are  previously  matched  elements  in  the  common  tail  of 
the  two  classes)  . 

The  unifyclass  procedure,  when  presented  with  two  non-empty 
classes  with  no  previously  unmatched  common  elements  or  duplicate  i 

elements  within  a sing-le  class,  always  matches  the  first  element 
of  the  first  class  at  each  step  after  a possible  interchange  of 
I the  two  classes.  The  interchange  is  a normalization  operation 

whose  result  is  that  the  first  element  of  the  first  class  after 
the  interchange  is  guaranteed  to  be  previously  unmatched. 

, Class  unification  proceeds  almost  exactly  as  does  class 

semi-unification  described  earlier  except  only  a single  element 
of  the  opposing  class  is  included  in  each  substitution  for  a 
fragment  variable  as  in  vector  unification  described  earlier. 

X£iii5i25i_Zif 

1.  If  X is  of  the  form  (CLASS  a a y a c)  for  arbitrary  element 
a then  unify! (CLASS  « T a c);  y] . 

2.  ,If  y is  of  the  form  (CLASS  6 a « a c)  for  arbitrary  element 
a then  unify(x;  (CLASS  8 i a c)]. 

3.  If  X is  of  the  form  (CLASS  o a y)  and  y is  of  the  form  J 

(CLASS  8 a 5)  for  arbitrary  element  a and  Y^fi  then  | 

unify!  (CLASS  a Y a)  ; (CLASS  8 6 a)]. 
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4.  If  X is  of  the  form  (CLASS  «)  and  y is  of  the  form  (CLASS  g 
o)  then  unifyfy;  x] . 

5.  If  X is  of  the  form  (CLASS  *.*X  «)  then 

a.  for  each  fragment  variable  •►♦Y  in  y (y  possibly  reordered 

is  of  the  form  (CLASS  ♦♦Y  s))  let  8={X-(CLASS  ♦♦X', 

Y»(CLASS  4.*T  ♦♦Y')}  and  return  (eo  I o e unify[xe;  ye]} 
and 

b.  for  each  element  b in  y which  is  not  a fragment  variable 
and  does  not  contain  an  instance  of  variable  X (y  possibly 
reordered  is  of  the  form  (CLASS  b S)  ) let  6={X*(CLASS  b 

♦ ♦X')}  and  return  lea  I o e unifylxe;  ye)}  and 

c.  let  6={X*(CLASS)  } and  return  lea  la  c unifylxe;  ye]}. 

6.  .If  X is  of  the  form  (CLASS  a a)  where  element  a is  not  a 
fragment  variable  then 

a.  for  each  fragment  variable  ♦♦Y  in  y for  which  a does  not 
contain  an  instance  of  variable  Y (y  possibly  reordered  is 
of  the  form  (CLASS  ♦*Y  6))  let  e={Y*(CLASS  a ♦♦Y')}  and 
return  (eo  I a e unifylxe;  ye]}  and 

b.  for  each  element  b in  y which  is  not  a fragment  variable 
(y  possibly  reordered  is  of  the  form  (CLASS  b S)  ) return 
(8o  1 0 € unifyla;  b]  a o c unifylxe;  ye]}. 

Notes: 

1.  -If  a class  has  a repeated  element,  the  first  occurrences  of 
that  element  are  deleted  (steps  1,  2). 

2.  .If  two  elements  of  two  classes  not  in  the  common  tail  of 
the  classes  (not  previously  matched)  are  identical,  the  classes 
with  the  elements  moved  to  tne  end  are  unified  (step  3). 
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3.  To  insure  that  the  next  substitution  operation  use  a 
previously  unmatched  element,  the  classes  are  interchanged  if  the 
second  is  a tail  of  the  first  (step  4). 

4.  If  the  first  element  of  x is  a fragment  variable,  each 
substitution  composed  of  a non-fragment  variable  element  of  y not 
containing  the  variable  oeing  substituted  for  and  a new  fragment 
variable  is  performed  and  the  instantiated  classes  are  unified 
for  each  suostitution  (step  5b).  This  forms  a most  general 
substitution  for  the  fragment  variable  including  each  opposed 
non-fragment  variable  element.  For  each  fragment  variable 
element  of  y,  the  fragment  variable  of  y is  split  into  two 
fragment  variables  and  one  is  included  in  a substitution  for  the 
first  element  of  x as  above  and  the  instantiated  classes  are 
unified  for  each  substitution  (step  5a).  This  forms  a most 
general  substitution  for  the  two  fragment  variables  expressing 
their  common  (■►♦T)  and  disjoint  (♦♦X',  ♦*Y')  parts.  In  addition, 
the  empty  class  is  substituted  for  the  first  element  of  x (step 
5c)  . 

5.  If  the  first  element  of  x is  not  a fragment  variable,  it  is 
unified  with  each  non-fragment  variable  element  of  y and  the 
classes  instantiated  by  the  returned  unifiers  are  unified  (step 
6b).  For  each  fragment  variable  element  of  y not  contained  in 
the  first  element  of  x,  the  substitution  for  the  fragment 
variable  of  the  first  element  of  x and  a new  fragment  variaole  is 
performed  and  the  instantiated  classes  are  unified  for  each 
substitution  (step  6a).  This  forms  a most  general  suostitution 
for  the  fragment  variable  including  the  first  element  of  x. 
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Tne  alacrithm  can  readily  be  shown  to  be  sound  using  a proof 
similar  to  the  proof  for  (class)  semi-unification.  Steps  1,  2, 

3,  and  4 are  tne  same  as  for  semi-unification  and  nonce  their 
soundness  can  be  justified  in  the  same  way.  Steps  5 and  6 always 
perform  a substitution  on  the  arguments  and  recursively  call  the 
unification  algorithm  with  the  instantiated  arguments  and  return 
the  compositions  of  the  substitution  and  the  returned  unifiers. 
This  is  also  clearly  sound. 

Completeness  is  also  easy  to  prove.  Steps  1 and  2 merely 
remove  redundant  duplicate  elements  from  a single  class  forming 
an  equivalent  class.  Step  3 matches  identical  elements  from  the 
two  classes.  Every  previously  unmatched  fragment  variable  is 
permitted  to  contain  any  element  of  the  opposed  expression.  Any 
previously  unmatched  non-fragment  variable  is  permitted  to  be 
matched  to  any  element  of  the  opposed  expression  or  be  contained 
in  any  fragment  variable  of  the  opposed  expression.  Any  pair  of 
fragment  variables  can  be  matched,  selecting  out  their  common  and 
disjoint  subclasses.  The  procedure  is  clearly  exhaustive, 
exploring  all  alternatives  for  each  previously  unmatched  element, 
and  forming  most  general  substitutions  for  each  case. 

The  procedure  will  not,  in  general,  terminate  for  the  same 
reasons  the  vector  unification  algorithm  does  not  terminate.  The 
algorithm  when  matching  with  a fragment  variable  often  maintains 
the  same  number  of  variables  in  the  instantiated  expressions  to 
be  unified  or  even  increases  them.  Each  of  these  new  fragment 
variables  can  include  the  value  of  an  element  of  the  opposed 
ex pr ession. 


279 


For  example,  consider  the  unification  of  (CLASS  **X)  and 
(CLASS  A)  . 

unify((CLASS  ♦♦X) ; (CLASS  A)J 
1 unify! (CLASS  --Xl  A);  (CLASS  A)] 

I I unify  I (CLASS  A) ; (CLASS  A)] 

I I return  {X*(CLASSA),  Xl*(CLASS)} 

I I unify! (CLASS  *-X2  A);  (CLASS  A) 1 
I I I unify! (CLASS  A) ; (CLASS  A)] 

1 i i return  (X* (CLASS  A A),  Xl*  (CLASS  A),  X2*(CLAS3)1 

I I I unify! (CLASS  **X3  A);  (CLASS  A)] 


The  important  thing  to  recognize  in  the  example  is  that  all 
the  returned  unifiers  are  equivalent;  repetitions  in  a class  are 
totally  redundant. 

For  this  reason,  we  can  place  additional  restrictions  on  the 
matches  performed  with  fragment  variables  and  still  retain 
completeness.  This  set  of  restrictions  is  adequate  to  assure 
termination  in  all  cases. 

1.  Never  perform  step  5a  on  two  variables  whose  ancestors  have 
been  matched  against  each  other.  Assume  ♦*X  is  a descendant  of 
♦♦Xq  and  ♦♦Y  is  a descendant  of  i.e.,  ♦♦X  is  ♦•“Xjj,  generated 

after  a sequence  of  substitutions  1X^*(CLASS  bj^  *"*'^i+l)) 
performed  and  ♦*y  is  ♦♦Yp  generated  after  a sequence  of 

and  Xq  and  Yq  were  matched 


substitutions  {Yj*(CLASS  aj 


against  each  other  with  result  {Xy*(CLASS  ♦♦X'),  Y^j*(CLASS 

-♦T  Tnen  X^j,  and  Y^  should  not  be  unified  against  each 

other.  X is  either  descended  from  X'  or  from  T;  Y is  either 
descended  from  Y*  or  from  T.  If  X is  descended  from  X'  and  Y is 
descended  from  Y',  we  can  justify  not  matching  X and  Y by  tne 
assumption  that  T contained  all  the  common  elements  of  ancestors 
Xjj  and  Yq,  and  hence  X'  and  Y',  and  X and  Y have  no  common 
elements.  If  X is  descended  from  T and  Y is  descended  from  Y*, 
we  can  justify  not  matching  X and  Y since  T and  Y'  together 
compose  Yq  and  thus  can  be  assumed  to  have  no  common  elements, 
and  hence  X and  Y have  no  common  elements.  If  X is  descended 
from  X'  and  Y is  descended  from  T,  we  can  justify  not  matching  X 
and  Y since  T and  X'  together  compose  Xq  and  thus  can  be  assumed 
to  have  no  common  elements,  and  hence  X and  Y have  no  common 
elements.  If  both  X and  Y are  descended  from  T,  we  can  justify 
not  matching  X and  Y since  there  is  no  value  to  matching  elements 
of  the  same  class  against  each  other  directly. 

2.  Wever  perform  step  5a  or  step  6a  wnere  the  element  to  be 
included  in  the  value  of  the  fragment  variable  is  already  in  the 
value  of  any  ancestor  of  the  fragment  variable.  Performing  this 
step  under  these  circumstances  would  result  in  placing  a 
duplicate  element  in  the  value  of  the  ancestor  fragment  variable. 
This  restriction  can  be  applied  when  the  element  is  included  in 
the  value  of  any  ancestor  since,  in  the  case  where  the  fragment 
variable  has  multiple  ancestors,  some  containing  the  element  and 
some  not  containing  it,  an  alternative  sequence  of  substitutions 
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would  include  the  element  in  (a  descendant  of)  a fragment 
variable  representing  the  common  elements  of  the  ancestors. 

Since  there  is  a limit  on  the  number  of  matched  elements, 
being  the  number  of  non-fragment  variable  elements  in  the 
original  classes  (this  can  readily  be  seen  since  each  step  in  the 
algorithm  unifies  the  same  expressions  after  instantiation  or 
removal  of  duplicate  elements  or  matching  of  identical  elements)  , 
steps  5a  and  6a  can  be  performed  only  a finite  number  of  times, 
their  being  a limit  on  the  number  of  elements  that  can  be 
included  in  the  values  of  the  original  fragment  variables  which 
are  ancestors  to  all  derived  fragment  variables. 

The  introduction  of  additional  fragment  variables  in  step  5b 
creates  no  additional  difficulty  since  no  descendants  of  the 
introduced  fragment  variables  are  permitted  to  be  matched  against 
each  other. 

These  two  cases  account  for  all  the  non- terminating  behavior 
of  the  algorithm,  and  thus  the  algorithm  witn  these  constraints 
always  terminates,  is  sound,  and  is  complete. 


3.17.  Constraints 

Another  -facility  provided  by  0A4  pattern  matching  is 
constraints.  Two  types  of  constraints  can  be  applied. 

The  first  is  a type  constraint  applied  to  variaoles  which 
dictates  the  type  of  expression  (vector,  bag,  or  class)  wnich 
must  be  matched  to  the  variable.  For  example,  ♦X/8AG  represents 
the  requirement  that  variable  X must  be  bound  to  a oag. 
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The  type  constraint  may  be  readily  implemented  within  the 
confines  of  the  unification  algorithms  already  provided^by 
replacing  (on  input  or  when  the  expressions  are  unified) 
expressions  of  the  'form  ♦var/type  by  expressions  of  the  form 
(type  ♦♦var)  . For  example,  ♦X/BAG  would  be  replaced  by  (BAG 
♦ ♦X)  . 

The  second  is  a general  predicate  constraint  v/hich  can  be 
applied  to  any  expression  (although  its  effect  really  varies  only 
with  the  way  variables  of  the  expression  are  instantiated  by  the 
match).  For  example,  (''  (VECTOR  1 -X)  (LT  $X  2))  causes  a matcn 
to  fail  if  the  X of  (VECTOR  -X)  has  value  greater  than  or  equal 
to  2.  Thus,  the  expression  will  match  (VECTOR  1 0)  but  not 
(VECTOR  1 2)  . 

Given  (the  conventional  pattern  matching)  assumption  that  the 
opposing  (target)  expression  contains  no  variables,  general 
predicate  constraints  are  easily  effected  by  merely  applying  the 
predicate  to  the  opposing  expression  to  determine  the  truth  of 
the  predicate.  .If  the  predicate  is  true,  matching  continues.  If 
it  is  false,  the  current  partial  match  is  rejected. 

This  mechanism  does  not  work  in  the  case  of  the  opposing 
expression  containing  variables  or  other  complex  pattern  matching 
actors.  It  is  unreasonable  to  expect  (at  least  in  all  cases) 
that  the  predicate  has  sufficient  knowledge  to  either  propose  or 
determine  the  existence  of  instances  of  variables  or  pattern 
matching  actors  in  the  opposing  expression  for  wnich  the 
predicate  is  true. 
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Although  the  pattern  matching  facilities  of  0A4/0LI3P 
(disregarding  for  the  moment  the  added  data  types  of  BAGs  and 
CLASSes  and  considering  only  VECTORS)  appears  on  superficial 
examination  to  be  less  powerful  than  the  pattern  matching 
language  i'lATCHLESS  of  PLANNER,  the  ability  to  specify  general 
predicate  constraints  means  that  any  MATCHLESS  pattern  has  a 
corresponding  QA4/QLISP  pattern,  obtained  by  merely  incorporating 
all  the  matching  requirement  of  the  MATCHLESS  pattern  into  a 
general  predicate  constraint. 

There  are  three  principal  solutions  to  the  handling  of  these 
predicate  constraints.  The  first  solution  which  is  logically 
sound  and  complete  is  the  use  of  semi-unification  and  elaboration 
procedures  so  that  the  constraint  checking  is  applied  only  to 
ground  expressions,  or  expressions  at  some  level  of  complexity 
fully  understood  by  the  constraint  expression. 

Tne  more  pragmatically  reasonable  second  approach  is  to 
evaluate  the  constraint  expression  even  if  the  variables  it  is 
checking  are  not  fully  instantiated.  .It  is  then  the  constraint 
expressions  responsibility  to  decide  whether  the  match  should  be 
accepted  or  not,  the  decision  not  necessarily  being  correct  given 
the  lack  of  full  instantiation  of  the  variaoles. 

A third  approach  would  be  to  use  the  constrained  resolution 
approach  of  passing  down  the  constraint  and  determining  its 
correctness  only  when  X becomes  fully  instantiated  or  at  the  end 
of  the  deduction  in  cases  where  the  variables  are  not  fully 
instantiated.  Thus,  if  the  match  otherwise  succeeds,  and  X is 
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not  fully  instantiated,  the  constraint  (LT  §X  2)  would  he 
attached  to  X and  be  evaluated  as  soon  as  X is  fully 
instantiated,  or  at  the  end  of  the  deduction  if  X isn’t  fully 
instantiated  oefore  then. 


3.18.  Algebraic  'Unification 

So  far  in  discussing  unification,  we  have  not  concerned 
ourselves  at  all  with  the  semantics  of  the  symbols  appearing  in 
the  expressions  to  be  unified,  with  the  exceptions  of  data  types 
(VECTOR,  BAG,  CLASS)  and,  of  course,  variaoles.  This  omission  is 
not  because  such  concerns  are  unimportant,  but  because  (1)  the 
semantic  free  unification  we  have  been  discussing  is  important 
and  (2)  consideration  of  the  semantics  of  the  symbols  often 
readily  leads  to  areas  of  great  complexity  and  undecidability  of 
unif iability. 

Some  results  are  available  for  certain  interpretations  of 
symbols.  Certain  theories  (for  example,  group  theory)  can  be 
formulated  in  terms  of  complete  sets  of  reductions  (see  the  next 
section  for  a brief  discussion  of  them)  These  reductions  can  oe 
applied  to  a pair  of  expression,  the  expressions  being  equivalent 
if  and  only  if  they  reduce  to  the  same  expression.  .Included  in 
this  is,  of  course,  handling  of  the  group  identity  element,  which 
can  be  simplified  out  of  any  expression. 

Powerful  and  useful  though  complete  sets  of  reductions  are, 
they  are  not  universally  applicable,  most  importantly  not  being 
applicable  in  commutative  theories. 
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Another  aspect  of  algebraic  unification  is  demonstrated  by  the 
following  example.  .If  we  wish  to  unify  the  expressions  (+  ♦X  ♦y) 
and  (+  3 C)  where  + represents  the  addition  function  and  is 
DEFTYPEd  BAG,  bag  unification  will  return  the  unifiers  {X*-3,  Y*C} 
and  {X*C,  Y*3}.  This  is  the  best  tnan  any  unification  algorithm 
can  do  without  knowing  the  semantics  of  +.  If  we  do  use  wnat  we 
know  about  +,  then  we  can  return  a unifier  like  {X*(-  {+  3 C) 

-Y)  }. 

The  general  approach  is  then  to  form  an  equation  of  the  two 
expressions  oeing  unified,  e.g.,  (+  ♦X  ♦Y)={+  3 C)  and  then 
simplify  the  equation  to  provide  assignments  to  some  or  all  of 
the  variables  of  the  equation,  using  whatever  knowledge  one  has 
of  the  semantics  of  the  terras.  This  can  and  should  incorporate 
simplifications  such  as  replacing  operations  entirely  on  numoers 
by  their  results  wnen  such  applications  are  formed  in 
rearrangement  of  terms. 

Little  can  oe  said  concerning  general  techniques  involved  in 
this  algebraic  unification  since  it  is  so  thoroughly  conditioned 
on  the  semantics  of  the  terms  involved.  In  the  example  above, 
using  * (the  multiplication  function)  instead  of  + invalidates 
the  unifier  {X*{/  (*  3 C)  ♦Y)  because  of  the  singularity  of  the 
division  function  at  0.  Sandewall  [60]  proposes  a system  for 
conversion  of  logical  expressions  to  equivalent  procedural  forms 
which  includes  some  facilities  for  specification  of  the 
unification  mechanism  to  be  used  in  invoking  one  of  these 
proced  ures . 
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Some  form  of  algebraic  unification  is  probably  essential  in 
many  theorem  prover  based  systems  such  as  program  verifiers  in 
wnich  the  semantics  of  operations  like  + must  be  understood. 
However,  a general  need  for  algebraic  unification  does  not 
eliminate  the  utility  of  the  algorithms  presented  here  for 
unifying  such  items  as  vectors,  oags,  and  classes,  since  such 
more  primitive  procedures  must  oe  relied  upon  when  nothing  but 
the  data  type  is  known  about  the  semantics  of  the  operation. 


3.19.  Complete  sets  of  reductions 

This  section  is  intended  to  point  out  the  relevance  of  this 
current  investigation  into  unification  algorithms  to  a new  field 
of  research  in  mechanical  theorem  proving.  This  new  area  of 
complete  sets  of  reductions  looks  very  promising  though  it 
presently  has  some  significant  limitations  one  of  wnich  may  oe 
partially  addressed  by  some  of  the  ideas  presented  in  this 
chapter.  This  section  is  rather  indefinite  and  inconclusive,  but 
we  hope  it  demonstrates  a wider  applicability  of  the  ideas  of 
this  chapter  than  is  immediately  obvious. 

Complete  sets  of  reductions  are  sets  of  simplifying  rewriting 
rules  the  successive  applications  of  wnich  to  any  teim  by  some 
algorithm  is  guaranteed  to  terminate  (the  terms  "simplifying"  and 
"reduction"  refer  to  this  finite  termination  property)  and  is 
guaranteed  to  terminate  uniquely  regardless  of  the  order  of 
applications  of  reductions  (the  term  "complete"  refers  to  tnis 
unioue  termination  property). 
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Complete  sets  of  reductions  have  been  used  or  suggested  in 
previous  systems  such  as  Nevin's  theorem  prover  (341,  Guard  et 
al ' s theorem  prover  (571,  and  simplifiers  by  Slagle  I40j. 

They  were  first  thoroughly  formalized  by  Knutn  and  Bendix  (24] 
who,  most  importantly,  provide  an  algor  itnra  ( tne  Knutn-Bendix 
extension  algorith)  wnich  often  succeeds  in  extending  an 
incomplete  set  of  reductions  to  a complete  set.  They  have  also 
been  extensively  examined  by  Lankford  (26,27]. 

Unification  is  relevant  to  complete  sets  of  reductions  since 
unification  is  employed  in  the  test  for  completeness  (unique 
termination)  of  a set  of  reductions.  A subterm  of  the  left 
(unreduced)  side  of  a reduction  can  be  unified  with  the  left  side 
of  another  to  represent  the  case  of  both  reductions  being 
simultaneously  applicable.  If  in  every  case  wnere  such  a 
unification  is  possible  the  result  of  applying  the  first 
production  and  the  result  of  applying  the  second  reduction 
simplifies  to  the  same  term,  the  set  of  reductions  is  complete. 

This  test  for  completeness  plays  a vital  role  in  the 
Knuth-Bendix  extension  algo’'ithm  since  if  in  trying  to 
demonstrate  completeness  of  the  current  set  of  reductions  two 
different  non-reducible  terms  are  generated  from  the  same  term 
the  two  terms  are  composed  into  a new  reduction  which  is  proposed 
as  an  addition  to  the  current  set  of  reductions. 

An  example  of  the  performance  of  Knuth-Bendix  extension 
algorithm  (transcribed  from  a version  we  wrote  in  LISP  using 
unification  and  substitution  functions  of  P3TP)  is  the  following 
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production  of  a complete  set  of  reductions  for  free  groups 
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algoritnm  is  presented  with  reductions  1,  2,  and  3.  These  are 

the  left  identity  axiom  and  the  left  inverse  axiom  and  the 

associative  axiom  expressed  as  rewrites.  In  tne  case  of 

associativity,  “simplicity"  of  terms  is  defined  such  that 

(faf(bc))  is  simpler  than  f(f(ab)c). 

(1  (F  (E)  X)  X) 

(2  (F  (G  X)  X)  X) 

(3  (F  (F  X Y)  Z)  (F  X (F  Y Z)  ) ) 

(4  (F  (G  X2)  (F  X2  XI))  XI)  FROM  3 AND  2 
(5  (F  (G  (E))  XI)  XI)  FROM  4 AND  1 

(6  (F  (G  (G  (E)))  XI)  XI)  FROM  5 AND  4 

(7  (G  (E))  (E))  FROM  6 AND  2 

6 DELETED 
5 DELETED 

(8  (F  (G  (G  XI))  (E))  XI)  FROM  4 AND  2 

(8  (F  (G  (G  XI))  X2)  (F  XI  X2))  FROM  8 AND  3 

8 DELETED 

(lU  (F  XI  (E))  XI) 

(11  (G  (G  XI))  XI)  FROM  10  AND  9 

9 DELETED 

(12  (F  XI  (G  XI))  (E))  FROM  11  AND  2 

(13  (F  XI  (F  X2  (G  (F  XI  X2))))  (E))  FROM  12  AND  3 

(14  (F  X2  (F  (G  X2)  XI))  XI)  FROM  12  AND  3 

(15  (F  X2  (F  X3  (F  (G  (F  X2  X3))  XI)))  Xl ) FROM  14  AND  3 

(16  (F  (G  (F  XI  X2))  (F  Xl  (F  X2  X3)))  X3)  FROM  4 AND  3 

(17  (F  Xl  (F  X2  (F  X3  (G  (F  Xl  (F  X2  X3 ) ) ) ) ) ) (E))  FROM  13  AND  3 

(18  (G  (F  (G  (F  Xl  X2))  Xl))  X2)  FROM  16  AND  2 

(19  (F  (G  (F  Xl  (G  X2)))  Xl)  X2)  FROM  16  AND  2 

(20  (F  (G  (F  Xl  X2))  Xl)  (G  X2))  FROM  IS  AND  11 

is  DELETED 

18  DELETED 

(21  (F  (G  (F  X3  Xl))  (F  X3  X2)  ) (F  (G  Xl ) X2))  FROM  20  AND  3 

16  DELETED 

(22  (G  (F  (G  Xl)  X2))  (F  (G  X2)  Xl))  FROM  21  AND  2 

(23  (G  (F  Xl  X2))  (F  (G  X2)  (G  Xl ) ) ) FROM  22  AND  11 

12  DELETED 
21  DELETED 
20  DELETED 

17  DELETED 
15  DELETED 

13  DELETED 

Note  that  the  preceding  successful  completion  of  the  original  set 
of  reductions  used  a complexity  ordering  in  attempting  to 
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complete  the  set.  A depth  first  ordering  goes  down  a fruitless 

infinite  branch,  and  a breadth  first  ordering  is  likely  terminate 

tne  algorithm  with  failure  by  producing  something  like  (F  (G  (F 

XI  X2))  Xi)  and  (T  (G  (F  X3  X2))  X3)  as  the  final  results  of 

simplifying  a single  term,  neither  of  which  is  reducible  to  the 

other,  causing  termination  of  the  algorithm  with  failure. 

The  final  complete  set  of  reductions  (written  in  more 

conventional  form)  iSL: 

1.  e*x*x 

2 X** 

3.  (x*y)  *z*x*  (y*z) 

4.  x"|*(x*y)-»y 
7.  e“^*e 

10.  x*e-*x 

11.  (x-l^-l*x 

12.  x*x~^*e 
14.  x*(x"l*Y-y 
23.  ( x*y) ♦y"^ 

We  have  been  extensively  concerned  with  special  representation 
and  unification  of  associativity  and  commutativity.  Ttris  is  __ 
highly  relevant  to  this  new  field  of  complete  sets  of  reductions. 

We  have  adopted  a representation  for  associative  functions 
wnich  permits  an  arbitrary  number  of  arguments  rather  than 
requiring  par enthesi zation . Our  unparenthesized  form  is  a 
canonical  form  for  associativity;  likewise,  the  "all  parentheses 
to  the  right"  form  used  by  present  complete  sets  of  reductions 
technology  is  a canonical  form  for  associativity. 

We  feel  that  the  use  of  the  “all  parentheses  to  the  right" 
form  has  certain  dire  consequences  for  the  size  and  complexity  of 
the  complete  set  of  reductions  when  compared  with  the 
unparenthesi  zed  form. 
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If  the  unparontheci zed  form  is  used,  reduction  3 can  of  course 
be  eliminated,  and  reductions  4 and  14  can  be  eliminated  since 
their  presence  is  recuired  only  by  the  compulsion  to  retain  a 
parenthesized  structure.  The  resulting  set  of  7 reductions  is 
not  only  more  compact,  but  it  includes  no  reductions  which  are 
not  standard  axioms  of  group  theory. 

One  of  the  most  important  deficiencies  at  the  present  time  of 
complete  sets  of  reductions  is  their  inability  to  handle  theories 
including  commutativity.  The  reason  for  this  is  that  complete 
sets  of  reductions  are  simplification  based,  and  there  is  no  way 
the  rewriting  rule  x*y*y*x  can  be  interpreted  as  a simplification 
for  all  instances  of  x and  y. 

The  solution  to  this  problem  seems  to  lie  in  the  area  of, 
rather  than  having  a single  uniform  complete  set  of  reductions, 
having  a hierarchical  system,  composed  of  symmetric  rewriting 
rules  (such  as  for  commutativity)  and  reductions.  A term  is 
irreducible  if  no  reduction  is  applicable  to  any  term  derivable 
from  the  term  by  use  of  the  symmetric  rewriting  rules.  The 
unique  termination  criterion  would  be  replaced  by  a check  that 
the  two  irreducible  terms  be  rewritable  one  to  the  other  by  the 
symmetric  rewriting  rules,  i.e.,  that  they  be  in  the  same 
equivalence  class  determined  oy  the  symmetric  rewriting  rules. 

.It  seems  essential  that  the  output  of  application  of  the 
symmetric  rewriting  rules  be  equal  in  simplicity  to  the  input. 
Otherwise  there  is  a potential  difficulty  concerning  termination 
of  the  simplification  process. 
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In  this  area  too,  the  use  of  associativity  in  a reduction 
causes  complications.  .If  x.*y*y*x  is  taken  to  be  a symmetric 
rewriting  rule,  then  (a*a)*a  and  a*(a*a)  are  in  the  same 
equivalence  class,  but  a*(a*a)  is  simpler  than  (a*a)*a  in  present 
complete  sets  of  reductions. 

Tnus,  extension  of  complete  sets  of  reductions  to  handle 
commutativity  may  profit  from  using  the  unparenthesi zed 
representation  for  associative  functions. 

•In  addition,  special  unification  algorithms  (like  the  AC 
unification  algorithm)  may  be  usefully  employed  in  future 
extensions  to  the  Knuth-Bendix  extension  algorithm  to  handle 
commutativity  or  other  properties.  Such  algorithms  would  be  used 
rather  than  the  ordinary  unification  algorithm  in  the  generation 
of  terms  to  which  to  reductions  are  both  applicable. 


3.20.  .Implementation 

The  semi-unification  algorithms  for  vectors,  bags,  and 
classes,  and  the  AC  unification  algorithm  for 

associative-commutative  functions  of  the  first  order  predicate 
calculus  have  all  been  implemented  substantially  as  described 
previously. 

These  semi-unification  and  unification  algorithms  replaced  the 
standard  predicate  calculus  unification  algorithm  of  PSTP  as 
described  in  the  previous  chapter.  Thus,  the  new  PSTP  is  capable 
of  dealing  with  the  data  types  vectors,  bags,  and  classes,  and 
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the  corresponding  concepts  ol  associativity,  commutativity,  and 
idempotence  at  a high  level. 

The  semi-unification  procedures  arc  modifiaole  (by  the 
replacement  of  three  functions)  to  use  tne  CV\4/0LlSf  variable 
designation  conventions  (i.e.,  variables  to  be  assigned  values 
must  be  preceded  by  one  of  the  ♦,  , or  prefixes)  or  the 

PSTP  convention  for  the  predicate  calculus  (i.e.,  variables  are 
represented  by  (LISP)  atoms  while  constants  (as  any  function 
headed  term)  are  represented  by  lists). 

In  the  case  where  0A4/QLISP  conventions  are  used,  the  type  of 
unification  to  be  performed  is  dictated  by  the  type  of  the  head 
of  the  expression.  The  type  of  expression  neads  (i.e.,  function 
symbols)  can  be  set  by  the  DEFTYPE  function.  For  example, 
(DEFTYPE  (QUOTE  f)  (QUOTE  BAG))  declares  F to  have  type  oag  so 
that  the  unification  of  (F  ♦X  ♦Y)  and  (F  A B)  will  use  bag 
semi- unification . 

The  default  type  is  type  vector,  except  BAG  and  CLASS  are 
initially  typed  bag  and  class  respectively. 

In  the  case  of  predicate  calculus  representation,  DEFTYPEing 
the  function  symbols  is  not  sufficient.  Since  variables  of  the 
predicate  calculus  have  no  prefixes  to  distinguish  what  kind  of 
matching  can  be  performed  with  them  (i.e.,  whether  they  are  to  be 
treated  as  simple  or  fragment  variables),  it  is  impossible  to 
distinguisn  textually  the  cases  in  which  the  expression  f(xy) 
Should  oe  treated  as  if  f were  associative  or  not.  (In  the 
QA4/QLISP  representation,  associativity  would  be  represented  by 
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use  of  fcaoment  variables  rather  than  simple  variables; 
cases  the  DEt'TYPE  of  the  function  is  the  same.) 

The  solution  in  this  implementation  is  the  creation  of  three 
declarations:  ASSOCIATIVE,  COlii-IUTATIVE , and  lUEHPOTEWT. 

The  (ASSOCIATIVE  fj  ...  f^^)  function  declares  each  of  its 
argument  function  symbols  fj^  to  be  associative.  Tne  effect  of 
this  is  to  cause  each  variable  occurrence  as  an  argument  of  one 
of  tnese  functions  to  be  treated  as  if  the  variable  were  preceded 
by  the  (match  at  least  one  element)  prefix.  A function 

symbol  not  declared  to  be  associative  will  have  variaole 
occurrences  as  arguments  treated  as  if  the  variable  were  preceded 
by  the  ♦ (match  exactly  one  element)  prefix. 

The  (COMMUTATIVE  fj^  ...  f^,)  function  declares  each  of  its 
argument  function  symbols  f^  to  be  commutative.  The  effect  of 
this  is  essentially  equivalent  to  declaring  tne  type  of  tne 
function  symbol  to  oe  bag. 

The  (IDEMPOTENT  f^  ...  fj^)  function  declares  each  of  its 
argument  function  symbols  f^^  to  be  idempotent.  The  effect  of 
this  is  essentially  equivalent  to  declaring  the  type  of  the 
function  symbol  to  be  class. 

If  a function  symbol  is  not  declared  in  any  of  the  above  ways, 
it  will  be  treated  in  the  standard  first  order  predicate  calculus 
manner.  If  it  is  declared  to  be  associative,  all  the  variables 
will  be  implicitly  prefixed  variables  and  unification  will  be 
associative.  If  it  is  declared  to  be  commutative,  but  not 
associative,  commutative  unification  will  be  performed,  but  a 
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vari2ble  can  match  only  a single  element  of  the  opposing 
expression  just  as  in  the  predicate  calculus  case.  •!£  it  is 
declared  to  associative  and  commutative,  noth  the  previous 
effects  will  be  present,  representing  associativity  plus 
commutativity,  riere  it  is  optional  whether  bag  semi-unification 
or  AC  unification  is  performed.  If  it  is ’declared  to  be 
idempotent,  class  semi-unification  will  be  performed  (thus  the 
idempotent  declaration  effectively  does  a commutative  declaration 
as  well  since  class  unification  is  always  commutative;  there  is 
no  representation  in  the  data  types  vector,  bag,  and  class  for 
idempotence  without  commutativity)  . 

The  ASSOC.IATIV£,  COidWUTA’JIIVE , and  .IDEMPOTENT  declarations  can 
be  undone  by  (NOT-ASSOC,IAT.IVE  f;^  ...  f^^)  , (NOT-COMMUTATIVE  ... 
fp,)  , and  (NOT-I  DEM  POTENT  f ...  fj^)  respectively. 

3.21.  Experimentation 

•Some  preliminary  experiments  were  performed  to  test  the 
effectiveness  of  built-in  associativity  in  proolems  such  as  the 
GR0UP2  problem  cited  in  the  previous  chapter  and  taken  from  the 
CA3.6  study. 

The  original  axiomatization  for  the  GKOUP2  problem  includes 
two  4 literal  axioms  which  together  represent  associativity;  each 
refers  to  one  of  the  rearrangements  of  parentheses  in  the 
equality  f ( xf { yz) ) »f ( f ( xy) z . 

This  representation  is  clearly  incompatible  with  our 
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1. 

e*x=z  •* 

x=z 

2. 

x*e*y=z 

♦ x*y*z 

3. 

x*e=z  •* 

x*z 

4. 

x=z  ♦ e* 

X»2 

5. 

x*y=2  ♦ 

x*e*y=z 

6. 

X=Z  ♦ X* 

e=2 

7. 

U*U*X»2 

♦ e*x»z 

B. 

x*u*u*y= 

z ♦ x*e 

9. 

X*u*u*2 

♦ x*e=2 

10. 

e*x»z  ♦ 

u*u*  x= 

11. 

x*e*y=z 

♦ x*u* 

12. 

x*e=2  ♦ 

x*u*  u= 

13. 

a*  b=c 

14. 

b*a»c 

representation  for  associativity  which  permits  the  declaration  of 

particular  function  symbols  to  be  associative.  Accordingly,  the 

GK0UP2  problem  can  be  r eax iomati zed  in  the  following  way: 

add  identity  at  left  end  of  term 
add  identity  in  middle  of  term 
add  identity  at  right  end  of  term 

remove  identity  at  left  end  of  term 
remove  identity  in  middle  of  term 
remove  identity  at  right  end  of  term 

replace  e by  u*u  at  left  end  of  term 
'Z  replace  e by  u*u  in  middle  of  term 

replace  e by  u*u  at  right  end  of  term 

replace  u*u  by  e at  left  end  of  term 
^*2  replace  u*u  by  e in  middle  of  term 
replace  u*u  by  e at  left  end  of  term 

special  hypothesis 

theorem 

Note  that  although  the  equality  symbol  * is  used  in  the  above 
axiomatization , there  is  no  need  in  this  case  for  the  special 
equality  inference  rules,  and  = was  assigned  no  special 
interpretation  in  the  experiment. 

By  simple  inspection,  it  is  apparent  (since  no  antecedent 
literals  are  negated)  that  no  negative  literals  can  ever  appear 
in  a derived  chain,  and  thus  no  reduction  (requiring 
complementarity  of  literals,  and  thus  requiring  a negated 
literal)  can  occur.  Hence,  retention  of  A-literals  is  not 
necessary  for  completeness  in  this  case.  Therefore,  retention  of 
A-literals  was  suppressed  to  facilitate  subsumption  which  would 
now  have  to  check  subsumption  only  of  the  single  S-literal  in 
each  derived  chain.  Only  a very  weak  form  of  subsumption  was 
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employed,  which  checked  for  identity  of  the  derived  chains  (after 
variaole  standardization  so  tnat  variants  (v/itn  literals  in  the 
same  order)  will  be  subsumed) . 

A very  simple  ordering  function  was  employed  which  expanded 
chains  in  order  of  increasing  size  of  their  stored  list 
representation.  Extension  oy  all  axioms  was  employed  at  each 
step  rather  than  just  extension  by  a single  axiom,  i.e.,  SEARCH 
rather  than  SEARCH2  was  used,  a notably  inefficient  strategy. 
Nevertheless,  the  results  were  reasonably  successful,  with  11 
chains  being  expanded  (7  of  which  were  on  the  solution  path) 
resulting  in  the  derivation  of  147  chains  86  of  which  were 
eliminated  by  subsumption. 

The  resulting  proof  was: 


14.  b*a*c 

theorem 

15.  D*a*e=c 

extend 

by 

3 

16.  b*a*xl*xl»c 

ex  tend 

by 

9 

17.  e*b*a*xl*xl=c 

extend 

by 

1 

18.  x2*x2*b*a*xl*xl»c 

ex  tend 

by 

7 

19.  e*a*b“c 

ex  tend 

oy 

10 

20.  a*b*c 

ex  tend 

by 

4 

21.  0 

ex  tend 

by 

13 

The  most  important  step  in  the  proof  is  the  extension  of  chain  Id 
by  axiom  lU.  Here,  associative  unification  discovered  the 
unifying  substitution  {x2«-a,  xl*a*b,  u«-a*a*b,  x»a*b)  . 

This  result  can  be  substantially  improved  upon  by  (1) 
employing  a more  effective  search  strategy,  including  use  of  the 
3EARCH2  ratner  than  SEARCH  command  or  (2)  eliminating  use  of  the 
identity  element  e,  and  axiomatizing  the  problem  so  that  u*u  can 
be  directly  inserted  or  removed  anywnere  in  the  term. 

Tne  deficiency  of  testing  problems  like  GR0UR2  witn  built-in 
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associativity  is  the  enormous  change  in  rt-j^resentation.  (An  even 
larger  and  to  a certain  extent  more  appropriate  change  in 
representation  would  be  use  of  eauality  inference  rules 
permitting,  for  example,  the  replacement  of  axioms  1 tnrougn  6 by 
e^x^x  and  x*e=»x.)  This  change  of  representation  not  only  makes 
performance  comparisons  difficult,  it  will  often  Dias  the  results 
against  built-in  associativity  since  the  new  representation 
produces  only  single  literal  derived  chains  which  cannot  oe 
distinguished  by  length  whereas  the  old  representation  produced 
length  varying  chains,  use  of  the  associative  axioms  adding  3 
literals  to  a derived  chain,  and  lengtn  preference  strategies  are 
among  the  most  effective  theorem  proving  search  strategies. 

.In  order  to  make  a more  meaningful  assessment  of  the 
effectiveness  of  building  in  associativity  and  commutativity,  we 
looked  for  a set  of  problems  in  which  built-in  associativity  and 
commutativity  could  be  turned  on  or  off  without  altering  tne 
problem  representation  (except,  of  course,  for  the  deletion  of 
the  associative  or  commutative  axioms  when  associativity  or 
commutativity  were  turned  on) . 

A set  of  problems  for  manipulation  of  the  arithmetic  addition 
and  subtraction  functions  for  which  this  could  be  done  was  found 
in  an  experiment  for  the  FORTRAN  Deductive  System  (FDS)  (591. 

A predicate  calculus  axiomatization  for  these  problems, 
originally  expressed  in  terms  of  rewriting  rules,  was  provided  by 
Chang  [9]. 

12  of  the  Iri  FDS  problems  were  run  with  two  search  strategies 
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and  with  (1)  neither  associativity  nor  commutativity  built-in, 

(2)  only  commutativity  built-in,  (3)  only  associativity  built-in, 
(4)  both  associativity  and  commutativity  ouilt-in.  This  is  a 
total  of  12*2*4=9fa  attempted  proofs  d3  of  which  were  successful. 
With  the  exception  of  example  6,  any  attempted  proof  recuiring 
the  derivation  of  more  than  500  chains  was  aborted,  wot  all 
strategies  were  tested  on  the  remaining  6 of  lo  fuS  problems,  but 
those  that  were  resulted  in  no  proof  within  the  500  chain  limit. 
Thus,  to  conserve  effort,  only  these  12  were  fully  examined. 

Note  that  these  results  should  only  be  used  for  examining  tne 
effectiveness  of  the  building-in  of  associativity  and 
commutativity.  They  are  not  comparable  to  the  results  of  fOS 
since  (1)  the  predicate  calculus  representation  and  rules  of 
inferences  differ  from  those  of  FDS  wnich  more  closely  resembles 
a paramodulation  based  system  since  rewriting  rules  can  be 
applied  to  any  suoterm,  (2)  naive  search  strategies  are  being 
employed  here  by  PSTP  as  opposed  to  the  means-ends  analysis  of 
FDS,  (3)  FDS  is  a parameterized  incomplete  system  whereas  the 

i 

I 

search  strategies  employed  oy  PSTP  are  complete  (though  it  is  not 
noted  in  (591  what  effect  the  parameters  had  on  the  solution 
costs  for  FDS,  and  (4)  all  PSTP  proofs  used  the  same  set  of  11 
axioms  whereas  FDS  added  as  an  axiom  each  previously  proved 
theorem,  (5)  FDS  employed  learning  to  fill  in  values  of  a 
signature  table  to  optimize  its  performance.  (Since  PSTP  is 
itself  a general  symbol  manipulation  system,  it  could  be  used  or 
adapted  to  possess  these  features  of  FE)S.  However,  such  use  or 
adaptation  was  outside  the  scope  of  this  experiment.) 
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Acceptable  Chains  Generated 


Problem 

501/100  stragegy 
with  built-in 
None  Commu.  Assoc 

nblits  + size  blits  strategy 
with  built-in 

. A.+C.  None  Commu.  Assoc.  A.+C. 

1. 

63 

1 

1 

1 

13 

1 

1 

1 

2. 

3 

3 

3 

3 

3 

3 

3 

3 

3. 

4 

5 

4 

5 

3 

3 

3 

3 

4. 

230 

1 

32 

1 

155 

1 

23 

1 

5. 

49 

5 

63 

5 

12 

3 

12 

3 

6. 

>500 

277 

>500 

2 60 

>500 

>500 

>500 

>500 

7. 

>500 

21 

17 

17 

95 

45 

74 

b 

6. 

2615 

5 

>500 

5 

2 85 

3 

21 

3 

9. 

4ti 

1 

59 

1 

12 

1 

12 

1 

10. 

30 

34 

30 

34 

13 

13 

13 

13 

11. 

>500 

256 

130 

31 

326 

18 

68 

13 

12. 

>500 

316 

>500 

292 

>500 

>500 

4 90 

44 

Table  3. 

. 1 

The 

resul ts 

of  this 

ex  per 

iment  are  presented  in 

Appendix  B.  A 

summary 

of  the 

number  < 

o-f  chains  generated 

for  each 

proof 

attempt 

is  shown  in  Table  3.1. 

.In  the  timing  statistics. 

a time 

suffixed 

by  represents  a time 

obtained  using  the 

version 

of  PSTP 

described  in  the  previous  chapter  wnich  had  a unification 

algorithm  codec 

1 in  assembly 

language  and  having  no 

capability  for 

built-in  associativity 

or  commutativity. 

All  other  times 

were 

obtained  using 

a later 

version  with 

a unification 

algoritnm  coded 

in  LISP 

and  allowing  for  associativity  and 

commutativ ity . 

Comparison  of  the  timing  results  snow  the 

latter  version 

to  be 

3U0 


approximately  2.5  times  slower  even  when  built-in  associativity 
and  commutativity  are  not  oeing  used. 

Tne  two  search  strategies  used  were  the  unbounded  501/luO 
strategy  discussed  in  the  previous  chapter  and  a simple  unbounded 
strategy  ordering  cnains  on  the  basis  of  expected  size  of  stored 
list  representation  of  B-literals.  Tne  latter  strategy  was 
generally  though  not  uniformly  superior  for  these  examples.  Tnis 
is  attributable  to  the  importance  of  complexity  of  the  generated 
terms  (e.g.,  function  depth)  which  is  not  considered  by  tne 
501/100  strategy  which  considers  only  length  and  deduction  level. 

The  proofs  discovered  by  the  various  combinations  of  search 
strategy  and  use  of  built-in  associativity  and  commutativity  are 
also  presented  in  Appendix  B. 

Proofs  requiring  built-in  associativity  or  commutativity  are 
annotated  oy  * or  **  at  the  steps  at  which  the  proof  would  be 
clocked  if  ouilt-in  associativity  or  commutativity  were  turned 
off.  Of  course  in  the  case  of  associativity,  associativity  is 
used  in  places  other  than  just  the  noted  lines  since  though  the 
proof  may  not  be  blocked  at  a certain  step,  tne  step  may  require 
substitution  for  a variable  using  the  associative  convention  of 
eliminating  the  extra  variables,  i.e.,  using  '£(abc)  for 
f{f(ab)c).  For  example,  chain  17  of  proof  C of  example  4 has  the 
term  a+(b-c)+xl  in  the  last  literal  requiring  built-in 
associativity  be  turned  on  lest  the  term  be  represented  as 
(a+(b-c))+xl  or  a+ ( (b-c) +xl ) , but  the  proof  would  not  be  blocked 
by  lack  of  built-in  associativity  until  the  following  step. 
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Only  rarely  (example  3,  Sul/lUU  strategy,  commutativity; 
example  5,  5ul/lou  strategy,  associativity;  example  y,  5ul/luo 
strategy,  associativity;  example  lu,  501/1U0  strategy, 
commutativity)  did  the  use  of  built-in  associativity  or 
commutativity  degrade  the  performance  of  PSTP  on  the  proolems, 
and  never  in  a case  where  the  commutative  axiom  was  used  in  the 
nothing  built-in  case  did  building-in  commuativity  degrade 
performance,  and  likewise  for  associativity. 

In  examples  2,  3,  and  10,  no  improvement  was  found  using 
built-in  associativity  or  commutativity,  and  the  same  proof  was 
found  for  all  combinations  of  search  strategy  and  building-in 
associativity  or  commutativity.  Neither  the  associative  axiom 
nor  the  commutative  axiom  was  used  in  any  of  these  examples. 

Far  more  typical  are  the  results  for  the  other  problems  in 
which  substantial  improvements  were  noted  for  either  building-in 
either  associativity  or  commutativity.  For  some  examples  and 
search  strategies,  the  only  proofs  discovered  within  the  5vJ0 
generated  chains  limit  were  discovered  using  built-in 
associativity  or  commutativity  (examples  6,  7,  11,  and  12). 

There  was  an  up  to  500  to  1 improvement  in  the  number  of  chains 
generated  (example  8)  for  using  built-in  associativity  or 
commutativity. 

Several  proofs  were  substantially  shortened  by  building-in 
associativity  or  commutativity,  such  as  proofs  of  example  1,  4, 
and  9 being  reduced  to  trivial  one  step  proofs.  .In  example  1, 
proof  B represents  the  use  of  commutativity  of  + to  prove  one 
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associative  rewrite  derivable  from  the  otner!  In  some  cases,  use 
of  built-in  associativity  improved  performance  even  when  the 
original  proof  contained  no  use  of  the  associative  axiom,  as  in 
ex  am  pi  e 8 . 

.In  summary,  the  use  of  built-in  associatiy ity  or  commutativity 
was  highly  effective  for  this  set  of  problems,  and  the  experiment 
strongly  supports  the  claim  of  usefulness  of  building-in  these 
concepts  in  this  manner. 

3.22.  Background  and  summary 

Material  presented  here  concerning  data  types  and 
representation  of  expressions  is  based  on  QLISP  [37].  CLISP  is 
an  extension  of  LISP  possessing  most  of  the  capability  of  CA4 
(43],  a language  interpreted  in  LISP. 

Specifically,  the  choice  of  fundamental  data  types  of  vectors, 
bags,  and  classes  was  made  since  these  are  the  data  types  of 
OLISP.  The  syntax  and  most  of  the  semantics  for  simple  variables 
and  for  fragment  variables  with  ♦♦  and  S5  prefixes  follow  CLISP 
rules.  The  association  of  a type  with  the  head  of  each 
non-atomic  expression  is  motivated  by  the  CLISP  DEFTYPEing 
capability;  however,  here  the  type  is  permanently  associated  witn 
the  head  and  affects  future  pattern  matching  or  unification 
whereas  DEFTYPEing  in  CLISP  defined  input/output  conventions  for 
expressions  with  some  head. 

Tne  principal  extension  to  data  types  and  expression 
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representation  presented  here  is  the  introduction  of  fragmei.t 
variables  witn  and  prefixes.  Although  the  use  of  CI-ISP 

fragment  variables  permits  the  expression  of  variable  lengtfi 
expressions  and  associativity,  it  is  the  use  of  the  fragment 
variables  with  and  $5$  prefixes  which  permits  their 

expression  in  a fashion  identical  to  that  of  the  formal  logic. 
They  also  serve  a useful  role  in  eliminating  some  redundancy  in 
unifying  expressions  after  variable  splitting. 

Treatment  of  fragment  variables  is  extended  somewhat  from  that 
in  CLISP.  In  OLISP,  bags  and  classes  may  have  only  one  fragment 
variable.  Also,  fragment  variables  in  classes  match  only 
elements  not  matched  elsewnere.  Tnis  latter  restriction  is 
clearly  incomplete  and  is  thus  eliminated  here. 

.Incomplete  unification  algorithms  similar  to  semi-unification 
have  been  developed  for  the  first  order  predicate  calculus  case 
as  for  example  by  Nevins  [34].  The  separation  of  unification 
into  two  processes,  sem.i-unif ication  and  variable  splitting,  to 
provide  completeness  was  independently  discovered  by  Slagle  146] 
in  the  first  order  predicate  calculus  case.  In  the  case  of 
argument  lists  of  non-commutative  associative  functions,  ne 
defines  a "weak  associative  unification  algorithm"  corresponding 
to  our  vector  unification  in  the  slightly  more  general  vector 
case.  An  algorithm  for  commutativity  is  not  explicitly 
specified,  and  the  suggestion  for  combining  associativity  and 
commutativity  is  essentially  to  apply  the  weak  associative 
unification  algorithm  to  permuted  argument  lists  of  commutative 
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associative  functions.  This  must  oe  done  with  more  care  than  is 


suggested  by  Slagle  since  an  obvious  commutative  unification 
algorithm  (permute  the  elements  of  just  one  of  the  expressions) 
followed  by  weak  associative  unification  is  not  adequate  since 
variables  of  the  other  expression  would  still  be  constrained  to 
match  contiguous  sequences  of  elements  of  the  opposing  unpermuteo 
expression. 

•Two  forms  of  variable  splitting  were  presented  here.  The 
first  allowed  variable  splitting  to  be  performed  on  only  one  of 
the  expressions  v/ith  the  penalty  that  instances  of  that 
expression  must  be  formed  to  which  variable  splitting  is  applied. 
Tnese  instances  need  only  be  formed  in  those  cases  where  the 
variables  being  instantiated  could  be  matched  against  expressions 
similar  to  the  instances  formed.  The  second  requires  variable 
splitting  in  both  expressions  and  eliminates  the  formation  of 
instances.  The  choice  of  variable  splitting  rules  is  dependent 
on  the  type  of  expressions  which  are  most  likely  to  be 
encountered.  .If  fragment  variables  are  always  opposed  by 
sequences  of  variables  and  fragment  variaole  free  expressions, 
then  no  instantiation  is  needed  and  variable  splitting  need  only 
be  applied  to  one  of  the  expressions. 

Tne  variable  splitting  rule  is  called  "widening"  by  Slagle  and 
is  applied  to  both  expressions  and  is  equivalent  to  the  second 
form  above.  Variable  splitting  or  widening  corresponds  to 
paramodulation  by  functional  reflexive  axioms  in  conventional 
mechanical  theorem  proving.  Tne  application  of  variable 
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splitting  represents  parainodulation  by  the  functional  reflexive 
axiom  for  tne  function  which  is  the  head  of  the  expression  with 
the  variable  being  split.  The  sometimes  required  instantiation 
of  an  expression  by  a vector,  bag,  or  class  subexpression  which 
is  then  subjected  to  variable  splitting  (this  may  be  necessary  in 
the  case  where  variable  splitting  is  applied  to  only  one  of  the 
expressions,  not  to  both)  corresponds  to  par amod ul ation  by  the 
functional  reflexive  axiom  for  the  function  which  is  the  head  of 
the  expression  being  added. 

The  complete  unification  algorithms  for  bags  and  classes 
presented  here  are  new.  These  should  be  useful  primarily  in 
languages  and  applications  for  which  completeness  is  very 
important  and  tnere  is  no  advantage  to  having  incomplete 
semi-unification  around  which  merely  returns  the  most  likely  to 
be  useful  unifiers. 

In  summary,  it  is  our  view  that  artificial  intelligence 
languages  should  be  designed  in  a manner  which  readily  logical 
completeness.  Tne  use  of  incomplete  pattern  matching  facilities 
in  artificial  intelligence  languages  is  obviously  a major  cause 
of  incompleteness  in  these  languages.  Accordingly,  we  have 
suggested  a form  for  remedying  the  incompleteness  aspect  of 
pattern  matching  without  excessively  penalizing  the  language 
designer  or  user  with  the  costs  of  ensuring  completeness.  Tnis 
is  accomplished  by  separating  the  pattern  matching  (unification) 
process  into  two  components:  the  incomplete  semi-unification 
process  and  the  completeness  providing  variable  splitting 
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process.  We  stated,  and  proved  termination,  sounoncss,  and 
completeness  (with  variable  splitting)  for,  semi-unification 
algoritnms  for  the  vector,  bag,  and  class  data  types.  ,in 
addition,  we  separately  stated  and  proved  complete  unification 
algorithms  for  bags  and  classes. 
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4.  Summary  and  Conclusion 


This  thesis  takes  a rather  theoretically  oriented  look  at  what 
we  call  completeness  issues  in  artificial  intelligence  languages. 
Two  principal  areas  of  incompleteness  are  noted:  inference  system 
and  pattern  matching.  Each  of  these  areas  of  incompleteness  is 
treated  in  the  most  constructive  fashion:  we  provide  revised 
inference  systems  and  pattern  matching  facilities  which  overcome 
the  incompleteness.  Further,  this  i-j  done  with  a view  to 
minimizing  the  added  cost  of  running  a logically  complete  versus 
incomplete  system.  Also,  the  results  we  provide  are  often  of  as 
much  or  even  more  significance  to  the  field  of  theorem  proving  as 
they  are  to  the  design  of  artificial  intelligence  languages. 

We  feel  we  have  contributed  significantly  to  the  solution  of 
problems  of  logical  incompleteness  of  artificial  intelligence 
languages.  The  elimination  of  the  incompleteness  both 
facilitates  use  of  these  languages  in  their  present  applications 
and  encourages  their  use  in  new  domains,  notably  mathematical 
theorem  proving.  Our  work  in  developing  and  testing  special 
unification  algorithms  whose  nature  was  suggested  by  artificial 
intelligence  languages  represents  an  especially  significant  and 
beneficial  interaction  of  the  fields  of  artificial  intelligence 
languages  and  mechanical  theorm  proving.  We  believe  the  field  of 
mechanical  theorem  proving  can  continue  to  make  contributions  to 
the  design  of  artificial  intelligence  languages,  and  that  a major 
contribution  of  artificial  intelligence  languages  to  the  field  of 
mechanical  theorem  proving  (and  many  other  domains)  is  the 
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continuing  development  of  new  control  structures  used  in 
searching  for  proofs. 

Some  of  the  specific  accomplishments  of  the  thesis  are 
summarized  belo<' 


4.1.  An  inference  system  for  artificial  intelligence  languages 

In  the  area  of  inference  system  completeness,  we  present  the  w 
procedure  which  is  an  abstraction  of  the  MESON  procedure  which 
does  not  specify  a conceptual  execution  form  (AND/OR  tree  for 
MESON  procedure,  chains  for  linear  MESON  procedure).  The  * 
procedure  is  a direct  extension  of  the  problem  reduction  method 
which  results  in  a complete  theorem  proving  method  for  the  first 
order  predicate  calculus. 

In  contrast  to  the  MESON  procedures  in  which  expansion  and 
contradiction  (extension  and  reduction)  are  two  different 
processes  (where  expansion  is  the  same  as  the  problem  reduction 
operator  and  contradiction  is  the  necessary  augmentation  of  the 
problem  reduction  method),  the  » procedure  is  a uniform  single 
inference  operation  procedure  which  continually  applies  the 
problem  reduction  operation  in  the  context  of  the  negations  of 
all  higher  goals. 

One  way  of  interpreting  the  t procedure  extension  of  the 
problem  reduction  method  in  terms  of  artificial  intelligence 
languages  is  to  say  the  v procedure  results  from  the  replacement 
of  every  (GOAL  x)  statraent  by  a goal  statement  having  the  effect 
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of  the  statements  (ASSERT  -ix)  (GOAL  x)  or  (DENY  x)  (GOAL  x)  where 
the  assertion  or  denial  is  local  only  to  the  (GOAL  x)  attempt  and 
does  not  cause  consistency  directed  data  base  deletions.  This 
certainly  achieves  logical  completeness  at  minimal  cost  in 
organization,  syntax,  and  efficiency  of  artificial  intelligence 
languages. 

The  1 and  related  procedures  also  have  special  equality 
inference  rules  deducible  from  the  related  rules  of  the  model 
elimination  with  paramodulation  procedure.  Two  distinct 
approaches  to  eliminating  some  of  the  problems  of  excessive 
generation  of  clauses  in  the  equality  inference  rule  system  are 
discussed:  a marking  restriction  which  tries  to  prohibit 
alternate  derivations  of  the  same  terra  by  ordering  the 
applications  of  equality  inference  rules  and  the  extension  of  the 
concept  of  demodulation  to  the  * procedure. 

We  also  attempted  to  deal  with  the  problem  of  the 
incompatibility  of  the  use  of  models  with  the  v procedure.  So 
long  as  the  problem  reduction  method  was  used,  goals  could  be 
freely  discarded  without  loss  of  as  much  completeness  as  exists 
for  the  problem  reduction  method  if  the  goal  was  false  in  a model 
of  the  problem.  This  is  no  longer  true  when  the  more  powerful  v 
procedure  is  used.  Paradoxically,  it  can  even  be  shown  that  it 
is  sometimes  necessary  to  expand  goals  which  are  not  only  false 
in  the  model,  but  are  false  in  the  problem  statement  as  well. 

By  relaxing  restrictions  on  the  v procedure,  we  provide  a 
procedure  which,  although  not  yet  proved  complete,  is  capable  of 
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handling  all  known  counter-examples  to  the  completeness  of  models 
in  the  » procedure. 


4.2.  Programmable  Strategy  Theorem  Prover 

The  Programmable  Strategy  Theorem  Prover  (PSTP)  was  developed 
for  its  own  sake  and  as  a testbed  for  some  of  the  ideas  developed 
in  the  thesis. 

A prime  advantage  of  PSTP  is  the  flexibility  of  specification 
of  search  and  deletion  strategy.  Although  the  underlying 
inference  system  is  fixed  in  contrast  to  some  other  theorem 
proving  systems,  the  specification  of  search  and  deletion 
strategies  is  very  general,  relying  on  a general  mechanism  for 
attaching  property  values  to  chains  (computed  on  need  only)  and 
using  ordered  numerical  values  of  specified  properties  to  sort 
chain  lists  and  specify  order  of  chain  inference  operations. 

This  simple  provision  for  pure  heuristic  search  (35)  using 
ordering  functions  was  itself  a novelty  in  theorem  provers  at  the 
time  it  was  installed  in  PSTP.  This  facility  was  extended  to 
allow  partial  expansion  of  chains  using  the  SEARCH2  command. 

Another  novel  characteristic  of  PSTP  is  its  use  of  the  linear 
MESON  procedure  rather  than  a form  of  resolution  as  its  inference 
system.  Although  it  is  complete  in  contrast  to  natural  deduction 
systems,  the  linear  MESON,  MESON,  and  v procedures  (all  variants 
of  the  same  idea)  retain  the  form  of  the  problem  reduction 
method.  This  problem  reduction  related  form  should  make  proofs 
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more  comprehensible,  especially  when  compared  with  proofs  usinq 
non-linear  resolution  refinements.  The  linear  MESON  procedure 
also  strongly  supports  the  heuristic  search  mechanism  of  PSTP. 
Since  derived  chains  do  not  interact  with  each  other  (inference 
operations  are  performed  only  on  a chain  or  a chain  and  an 
axiom),  the  possible  inference  operations  that  can  be  performed 
on  a chain  are  fixed  (the  chain  can  be  extended  by  any  axiom  or 
be  reduced).  In  non-linear  resolution  refinements,  a derived 
clause  could  always  be  resolved  against  some  future  generated 
clause,  never  allowing  for  the  possibility  of  its  being  removed 
from  the  search  space  because  all  possible  inference  operations 
had  been  performed  on  it. 

We  also  analyzed  the  performance  of  PSTP  to  the  performance  of 
QA3.6  and  an  SL-resolution  theorem  prover.  From  this  analysis, 
we  deduced  that  (for  that  class  of  problems)  PSTP  with  the  linear 
MESON  procedure  performed  competitively,  that  search  strategies 
which  give  higher  weight  to  length  of  clause  relative  to  level  of 
clause  than  admissable  ones  often  outperform  admissable 
strategies  (at  the  risk  of  generating  longer  proofs) , and  that 
factorization  as  required  by  the  SL-resolution  procedure  is 
probably  a bad  idea  (we  hold  that  the  optionality  of 
factorization  in  the  model  elimination  procedure  and  its 
derivatives  is  a major  advantage) . 
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4.3.  Unification  algorithms  for  artificial  intelligence  languages 


In  the  area  of  pattern  matching,  incompleteness  arises  in  the 
failure  to  return  all  possible  unifiers  of  two  expressions. 
Depending  on  the  nature  of  the  expressions  (such  as  the  general 
pattern  language  of  PLANNER) , this  problem  may  be  an  unsol vable 
one  due  to,  for  example,  the  undecidability  of  the  emptiness  of 
intersection  of  two  context  free  languages  or  the  solution  of 
arbitrary  groups  with  generators.  Even  simpler  constructs 
(VECTORS  of  QA4/QLISP)  have  unification  problems  of  unknown 
solubility  (the  word  equation  for  monoids  in  this  case  of 
associativity  only). 

A general  method  of  solution  is  to  attempt  to  enumerate 
unifiers  rather  than  produce  them  all  at  once.  This  is  often 
more  practical  since  there  may  be  an  infinite  number  of  unifiers 
prohibiting  their  total  generation  and,  since  unifiers  can  often 
be  generated  in  an  order  approaching  simplest  first  order,  and 
simple  unifiers  are  more  likely  to  be  useful,  effort  can  be  saved 
by  generating  unifiers  piecemeal. 

The  approach  suggested  here  involves  splitting  the  unification 
process  up  into  two  separate  processes:  semi-unification  and 
elaboration.  The  semi-unification  process  returns  for  a pair  of 
expressions  (one  possibly  restricted  in  form)  a finite  number  of 
unifiers.  This  set  of  unifiers  need  not  be  a complete  set. 
Semi-unification  should  return  the  most  obvious  unifiers  of  the 
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two  expressions.  This  finite  unification  component  is  augmented 
by  the  potentially  infinite  elaboration  process  which  alters  one 
(the  possibly  restricted  form)  expression  causing  the 
semi-unification  algorithm  to  produce  new  unifiers  using  the  new 
expression.  Alternating  applications  of  elaboration  and 
semi-unification  would  enumerate  all  unifiers. 

Three  special  cases  of  great  interest  are  discussed  in 
substantial  detail:  the  0A4/QLISP  VECTOR  (associativity) r BAG 
(associativity  and  commutativi ty) r and  CLASS  (associativity, 
commutativity,  and  idempotence  (sets))  data  types.  Here, 
semi-unification  is  a unification  process  restricted  to  not 
splitting  up  the  values  of  variables.  The  elaboration  procedure 
(actually  a slight  variant  of  the  idea  of  elaboration)  consists 
of  splitting  fragment  variables  into  two  fragment  variables. 

In  addition,  we  provide  complete  unification  algorithms  for 
the  BAG  and  CLASS  cases.  The  complete  unification  for  BAGs  is 
also  extended  to  the  slightly  more  complex  case  of  unification  in 
the  first  order  predicate  calculus  with  associativity  and 
commutativity  (the  AC  unification  algorithm).  We  regard  the  AC 
unification  algorithm  and  the  related  BAG  unification  algorithm 
as  having  substantial  significance. 

Experimentation  with  a version  of  PSTP  augmented  by  the 
special  unification  procedures  for  associativity  and 
commutativity  confirms  the  view  that  substantial  gains  in 
performance  can  be  achieved  by  such  means. 

In  parallel  with  this  work  on  unification  algorithms  there  has 
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been  significant  work  done  by  Livesey  and  Siekmann  in  the  sa.ne 
area.  They  have  been  concerned  with  the  problem  of  associative 
(vector)  unification  (62)  and  have  also  developed  algorithms  for 
commutativity  (equivalent  to  bag  semi-unification  with  no 
fragment  variables) , idempotence,  and  associativity  plus 
commutativity  plus  idempotence  (classes)  [63].  Commutativity  and 
idempotence  occur  infrequently  without  associativity  in  the 
problems  with  which  we  are  concerned,  so  Livesey  and  Siekmann 's 
most  important  work  to  us  is  that  concerned  with  decidable  cases 
of  associative  (vector)  unification  and  the  associativity  plus 
commutativity  plus  idempotence  (class)  case.  While 
associative-commutative  or  bag  unification  depends  on  the 
solution  of  a homogeneous  linear  Diophantine  equation,  they  make 
the  perceptive  observation  that  class  unification  operates  in  the 
same  fashion  except  the  solution  of  the  equation  uses  Boolean 
algebra  addition  rather  than  ordinary  arithmetic  addition.  This 
results  in  a cleaner,  and  probably  more  efficient,  class 
unification  algorithm  than  the  one  presented  here. 

Significant  developments  have  also  occurred  in  the  area  of 
complete  sets  of  reductions  discussed  briefly  in  section  3.19. 
Lankford  and  Ballantyne  have  developed  complete  sets  of 
reductions  for  commutativity  without  associativity  and  a 
semi-decision  procedure  for  incompleteness  of  sets  of  reductions 
using  permutation  equations  [64,65].  Both  are  based  on  the 
concept  of  a hierarchical  approach  to  complete  sets  of  reductions 
including  reductions  and  an  equivalence  class  specified  by 
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commutative  or  permutatlve  axioms.  Huet  [611  also  developed  this 
approach  to  a decision  (not  just  semi-decision)  procedure  for 
reductions  with  an  equivalence  class  where  the  reductions  are 
restricted  to  not  having  duplicated  variables  in  their  left  hand 
sides  (left  free).  Stickel  and  Peterson  have  developed  and 
implemented  the  idea  of  using  complete  unification  algorithms 
directly  in  computing  complete  sets  of  reductions  rather  than 
using  an  equivalence  class  [49] . Lankford  and  Ballantyne  also 
developed  some  of  the  same  techniques  for  application  to 
associative-commutative  reductions  and  used 

associative-commutative  unification  for  testing  completeness  of 
the  set  of  reductions  though  not  for  generating  new  reductions 
[66]. 


Appendix  A,  PSTP  Performance  Study  Results 


A.l.  Examples 


1.  BURSTALL  Exa^mple 


Axioms: 

1.  has(pl,ass( j,nO)  ) 

2.  follows (p2, pi) 

3.  has (p2,ass (k,nl)  ) 

4.  labels (loop, p3) 

5.  follows (p3,p2) 

6.  has (p3 , if then (equal (j , n) ,p4) ) 

7.  has (p4, goto (out) ) 

8.  follows (p5,p4) 

9.  follows (p6 ,p3)  . 

10.  has (p6 , ass (k , times  (n2, k) ) ) 

11.  follows(p7,p6) 

12.  has(p7,ass(j,plus(j,nl) ) ) 

13.  follows (p8 ,p7) 

14.  has (p8 , goto (loop) ) 

15.  follows (xp,yp)  * succeeds (xp,yp) 

16.  succeeds (xp, zp)  succeeds (zp, yp)  ♦ succeeds (xp, yp) 

17.  has (xp, goto (zp) ) labels (zp, yp)  ♦ succeeds (yp, xp) 

18.  has (xp, ifthen (ze,yp) ) ♦ succeeds (yp, xp) 


Theorem: 

19.  succeeds (p3,p3) 

2.  SHORTBURST  Example 


Axioms: 

1.  labels{loop,p3) 

2.  has (p3, if then (equal (j , n) ,p4) ) 

3.  has (p4 , goto (out) ) 

4.  follows (p5,p4) 

5.  follows (p8,p3) 

6.  has (p8,goto(loop) ) 

7.  follows (xp,yp)  ♦ succeeds(xp,yp) 

8.  succeeds (xp, zp)  succeeds (zp,yp)  ♦ succeeds (xp, yp) 

9.  has (xp, goto (zp) ) labels (zp, yp)  ♦ succeeds (yp,xp) 

10.  has  (xp,  if  then  (ze,yp) ) -»  succeeds  (yp,xp) 


Theorem: 

11.  succeeds (p3,p3) 
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3.  HAS-PARTS  Example  1 


Axioms: 

1. 

2. 

3. 

4. 

5. 

6. 
7. 


in (John, boy) 
in{x,boy)  ♦ in(x, human) 

hp(x,xm,y)  ♦ in(skl(x,y,z,xm,xn) ,y)  hp (x, t (xm, xn) , z) 
hp(x,xm,y)  ♦ -»hp (ski  (x, y,  z, xm, xn)  ,xn , z)  hp (x,  t (xm, xn)  , z) 
in(x,hand)  ♦ hp(x,n5, fingers) 
in(x, human)  ♦ hp(x,n2,arm) 
in(x»arm)  -*  hp (x, nl ,hand) 


Theorem: 

8.  hp (John, t (n2, nl) , hand) 


4.  HAS-PARTS  Example  2 


Theorem: 

8.  hp(John,t(t (n2,nl) ,n5) , fingers) 

5.  PRIM  Example 

Axioms : 

1.  Oxx 

2.  Dxy  Dyz  ♦ Dxz 

3.  Px  Og(x)x 

4.  Px  Lnlg(x) 

5.  Px  Lg(x)x 

6.  Lnlx  Lxa  ♦ Pf(x) 

7.  Lnlx  Lxa  ♦ Df(x)x 

8.  Lnla 

Theorem: 

(9.  -iPx  -tOxa  negation  of  theorem) 
10.  Pxl  Dxla 

6.  ANCESl  Example 

Axioms: 

1.  -.J  A H 

2.  K H J 

3.  -^K  H J 

4.  -A  -.8 

5.  -A  B 

6.  -H  -C 

Theorem: 

7.  B -.C 


318 


7.  NUMl  Example 


Axioms: 

1.  Px  Myzw  Dxw  ♦ Dxy  Dxz 

2.  Mxxs(x) 

3.  Mxyz  ♦ Myxz 

4.  Mxyz  ■»  Dxz 

5.  Mas(c)s(b) 

6.  Pa 

Theorem: 

7.  Dab 

8.  GROUPl  Example 

Axioms: 

1.  Pxyu  Pyzv  Pxvw  ♦ Puzw 

2.  Pxyu  Pyzv  Puzw  ■*  Pxvw 

3.  Pg(xy)xy 

4.  Pxh(xy)y 

5.  Pxyf(xy) 

Theorem: 

(6.  -«Pj(x)xj(x)  negation  of  theorem) 
7.  Pj  (xl)  xl  j (xl) 

9.  GR0UP2  Example 

Axioms: 

1.  Pxex 

2.  Pexx 

3.  Pxyu  Pyzv  Puzw  * Pxvw 

4.  Pxyu  Pyzv  Pxvw  ♦ Puzw 
5a  Pxxe 

6.  Pabc 

Theorem: 

7.  Pbac 


319 


1 


A. 2.  Statistics 


Length 

Multi- 

plier 

Level 

Multi- 

plier 

Length 

Maximum 

Level 

Maximum 

Proof 

Code 

Chains 

Generated 

(ret/acc/tot/att) 

Search 

Time 

«Isecl_ 

..  BURSTALL  Example 

101 

100 

A 

191/191/215/3129 

46.3 

501 

100 

- 

- 

3 

74/  74/  75/1229 

16.1 

101 

100 

2 

12 

A 

45/  45/  45/741 

9.8 

501 

100 

2 

12 

A 

45/  45/  45/741 

9.2 

QA3.6 

best 

2 

12 

38/  42/  /1462 

QA3.6 

mean  16/19  2 

12 

99/118/  /3222 

SLRTP 

3 

13 

48/  48/  /122 

2.  SH0RT8URST  Example 


101  100 

- 

- 

A 

18/ 

18/ 

19/144 

3.0 

501  100 

- 

- 

A 

16/ 

16/ 

16/128 

2.1 

101  100 

2 

10 

A 

16/ 

16/ 

16/128 

2.2 

501  100 

2 

10 

A 

16/ 

16/ 

16/128 

2.1 

QA3.6  best 

2 

10 

12/ 

12/ 

/255 

QA3.6  mean  14/14 

2 

10 

20/ 

21/ 

/325 

SLRTP 

3 

10 

16/ 

16/ 

— i 4 2 

. HAS-PARTS  Example  1 

Implicative  form 

for 

axioms: 

101  100 

- 

- 

A 

7/ 

7/ 

7/  47 

0.7 

501  100 

- 

- 

A 

7/ 

7/ 

7/  47 

0.7 

101  100 

3 

10 

A 

7/ 

7/ 

7/  47 

1.0 

501  100 

3 

10 

A 

7/ 

7/ 

7/  47 

0.8 

Disjunctive  form 

for 

axioms: 

101  100 

- 

- 

A 

12/ 

12/ 

12/124 

1.8 

501  100 

- 

- 

A 

12/ 

12/ 

12/124 

1.7 

101  100 

3 

10 

A 

12/ 

12/ 

12/124 

1.5 

501  100 

3 

10 

A 

12/ 

12/ 

12/124 

1.5 

QA3.6  best 

2 

10 

8/ 

10/ 

/il2 

0A3.6  mean  6/6 

2 

10 

20/ 

24/ 

/343 

SLRTP 

2 

10 

12/ 

12/ 

/ 29 

I 


320 


Length 

Multi- 

plier 

Level  Length  Level 

Multi-  Maximum  Maximum 
plier 

. HAS- 

PARTS  Example 

2 

Implicative  form 

for 

axioms: 

101 

100 

- 

- 

501 

100 

- 

- 

101 

100 

3 

10 

501 

100 

3 

10 

Disjunctive  form 

for 

axioms: 

101 

100 

- 

- 

501 

100 

- 

- 

101 

100 

3 

10 

501 

100 

3 

10 

QA3.6 

best 

2 

10 

QA3.6 

mean  6/7 

2 

10 

SLRTP 

3 

13 

. PRIM 

Example 

Implicative  form 

for 

axioms : 

101 

100 

- 

- 

501 

100 

- 

- 

101 

100 

3 

18 

501 

100 

3 

18 

Disjunctive  form 

for 

axioms: 

101 

100 

- 

- 

501 

100 

- 

- 

101 

100 

3 

10 

501 

100 

3 

10 

101 

100 

3 

18 

501 

100 

3 

18 

QA3.6 

best 

3 

10 

QA3.6 

mean  9/10 

3 

10 

SLRTP 

3 

11 

. AHCCSl  Example 

101 

100 

— 

501 

100 

- 

- 

iOl 

100 

2 

10 

501 

100 

2 

10 

QA3.6 

best 

2 

10 

QA3.6 

mean  19/20 

2 

10 

SLRTP 

3 

10 

Proof 

Chains 

Search 

Code 

Generated 

Time 

i£®£/S££/£2£/Stt]. i§ecj_ 


A 

11/ 

11/ 

11/  83 

1.3 

A 

11/ 

11/ 

11/  83 

1.3 

A 

11/ 

11/ 

11/  83 

1.3 

A 

11/ 

11/ 

11/  83 

1.9 

A 

50/ 

50/ 

50/478 

8.1 

B 

38/ 

38/ 

38/430 

5.7 

A 

38/ 

38/ 

38/430 

5.7 

B 

38/ 

38/ 

38/430 

6.4 

12/ 

14/ 

/205 

44/ 

51/ 

/938 

20/ 

20/ 

/ 41 

A 

812/812/1052/8072 

200.7 

A 

57/  57/  64/640 

11.6 

A 

70/  70/  82/816 

12.3 

A 

54/  54/  60/609 

8.2 

B 

165/165/187/1883 

28.5 

C 

101/101/113/1220 

18.8 

B 

130/130/146/1532 

20.2 

B 

130/130/146/1532 

18.8 

B 

130/130/146/1532 

21.7 

C 

101/101/113/1220 

16.8 

13/  19/  /208 

36/  97/  /999 

122/134/  /243 

A 

23/ 

23/ 

23/240 

3.2 

A 

13/ 

13/ 

13/108 

1.3 

A 

13/ 

13/ 

13/108 

1.6 

A 

13/ 

13/ 

13/108 

1.4 

5/ 

12/ 

/158 

6/ 

13/ 

/129 

14/ 

14/ 

/ 26 

321 


Length 

Multi- 

plier 

Level  Length 
Multi-  Maximum 
plier 

Level 

Maximum 

Proof 

Code 

Chains 

Generated 

(ret/acc/tot/att) 

Search 

Time 

-I§S£l- 

. NUMl 

Example 

101 

100 

A 

10/ 

10/ 

11/ 

47 

1.2 

501 

100 

- 

- 

A 

10/ 

10/ 

11/ 

47 

1.1 

101 

100 

5 

10 

A 

10/ 

10/ 

11/ 

47 

0.9 

501 

100 

5 

10 

A 

10/ 

10/ 

11/ 

47 

0.9 

QA3.6 

best 

5 

10 

8/ 

10/ 

/ 

68 

QA3.6 

mean  11/11 

5 

10 

9/ 

11/ 

/ 

83 

SLRTP 

5 

10 

9/ 

9/ 

/ 

21 

. GROUPl  Example 

101 

100 

* 

— 

A 

14/ 

14/ 

14/ 

54 

2.0 

501 

100 

- 

- 

A 

14/ 

14/ 

14/ 

54 

1.5 

101 

100 

3 

10 

A 

14/ 

14/ 

14/ 

54 

1.5 

501 

100 

3 

10 

A 

14/ 

14/ 

14/ 

54 

1.4 

QA3.6 

best 

3 

10 

7/ 

7/ 

/ 

33 

QA3.6 

mean  9/9 

3 

10 

7/ 

7/ 

/ 

34 

SLRTP 

3 

10 

9/ 

12/ 

/ 

35 

. GROUP2  Example 

101 

100 

* 

A 

576/576/752/2408 

97.2 

501 

100 

- 

- 

B 

119/119/149/500 

17.0 

101 

100 

4 

10 

A 

225/225/325/938 

28.8 

501 

100 

4 

10 

A 

225/225/325/938 

36.7 

QA3.6 

best 

3 

10 

54/ 

74/ 

/324 

QA3.6 

mean  8/8 

3 

10 

60/ 

82/ 

/517 

SLRTP 

? 

? 

no  proof 

found 

BURSTALL  Example 


I 


^ Ji  JQ  ^ J3 


J3  i3  i3 


(U'O'O'O'O'O'D'O  *0*0  ^*0*0 

WCCCCCCC  cc  ccc 

ociQ^OfOOiaic;  oo;  o^o/oi 

<U.U4J4JAJ-U4JJ4  4J  U U 4J  u 
^XXXKXXX  XX  XXX 

4Ja»ai4>a)4>4»4)  a)a»  4»«a> 


lA  lA^lOlA^^r*  ^ 

s >,  ><  >s  >.>.  >e  >^  >.>.>,  >• 

} *0  'O'OTJ'O'n'O'O'O'D'D 

:c  cccccccccc 

f Of 

144  4J4J4>44444J4J44444i 

(X  xxxxxxxxxx 

) o ^^oo^o^oct^t 


CO  r*  (A  <T»  ^ 10 

>.>.>.>,>,>* 

Xi  X3  a JiI3  ^ 

6 

fli  “D  *0  *0  *0  ^ 

w c c c c c c 

O 4>  4>  01  V 4U  O 

0)  44  4J  44  44  A4 

X X X X X X X 

44  0)  01  O Of  O 0* 


oi  01 

5 ^ 

O Of 

^ Xi 

w fl) 

o ^ 


<44 

01  •— 

O 3 

<44 

<44 

A 

^•4 

3 01 

^ a 

P-<4  CO 

00 

0)  ^ 

o 

.«4 

pA  ^ 

CO  d 

a 

««4 

^ 0 

^ <0 

X X 

a * 

4 

^ VO 

-4  r* 

X <A 

VO  d 

4 4 

4fA 

fA 

«N  CM 

so  d 

r-  d 

^4  >4^ 

<A 

d ^ 

fA  /A 

fA  d 

d 

X X 

a « 

a « 

o o 

<A 

fA 

d 

4 pA 

d d 

d^ 

4 «» 

4 4H 

4 PA 

*J  4J 

a 

d 

4 

•A  X 

'4» 

01 

(A 

•A  X 

^ X 

o o 

4 

4 

X 

01  01 

01  *0 

•a 

X X 

X ' 

X 

C O' 

A 

X 

w 

*o  *0 

*0  Of 

Of 

'M' 

01 

01 

4 4 

X 

X 

tf)  *3 

Of  Of 

Of  Of 

Of 

0)  0) 

01  'C 

0)  rj 

OD  09 

0) 

*3  01 

Of  Of 

Of  Q 

u 

V 'V 

*0  Of 

V 0 

d d 

01 

01 

a 

O Of 

o o 

u u 

u 

Of  Of 

Of  Of 

Of  Of 

A 

o 

Of  O 

o o 

O 3 

3 

0)  Of 

Of  u 

Of  o 

CQ  0) 

o 

Of 

o o 

3 3 

3 ta 

01 

Cf  o 

o u 

o o 

iq  ro 

A 

X 

U 3 

01  01 

01  — 

u o 

O 3 

U 3 

JZ  X 

<a 

o 

3 0) 

3 3 

3 01 

3 01 

0 

A ^ 

<44 

01  — 

l«  — • 

• >•^4 

4^ 

0)  01 

01  »— 

0)  ^ 

l-^M4 

<44 

A 

«-»  P>4 

44-  44 

44  44 

^ a 

4-4 «-»  44  r-  r» 

r*  r* 

44  44 

44  p^ 

44  44  44  00  CD 

pw 

o 

A 

A A d d 

d a 

a 

44  44 

44  44 

A A CO  d d 

A O 

A lA 

X X d « « 

4 4 

4 

44  A A 

A A 

X X a • * 

44  A 

X A 

A d 

4 4 4 fA  A 

A A 

A 

Add 

d d 

4 4 4 A A 

A d 

44  44 

Q.  « 

A #A  A d Q* 

o d 

a 

a » * 

4 4 

A A A d d 

a 4 

o o 

4 

a a Q 

4^  >4^ 

4 A A 

A A 

o.  a 

• A 

■U 

A X 

— 4-,  4^  OT  U) 

01  0) 

01 

A X X 

X X 

44  44>44>  OT  0) 

A X 

O 0 

X — 

Cf)  01  0)  ns  *3 

3 3 

3 

X 44  44 

w W 

0)  0)  01  3 3 

X 

c o* 

01 

•3  *3  *3  Of  Of 

Of  Cf 

Of 

01  01 

01  0) 

3 3 3 Of  0) 

01 

01  *0 

Of  Cf  Of  Of  Of 

Of  Of 

Of 

01  03  *3 

3 3 

Of  Of  Of  Of  Of 

01  3 

09  00 

*0  Cf 

Of  O Of  o o 

O O 

u 

3 Of  Of 

Of  Of 

Of  Of  Of  3 O 

3 Cf 

a d 

Of  Of 

O O O CJ  u 

O (J 

u 

Of  Of  Of 

Of  0) 

0 0 3 0 0 

Of  Cf 

«4»  44 

Cf  o 

O o U 3 3 

3 3 

3 

Of  O U 

o o 

0 0 0 3 3 

Cl  O 

0)  01 

o u 

3 3 3 0)  0) 

01  01 

01 

U 3 O 

o o 

3 3 3 0]  01 

O O 

<9  <0 

O 3 

0)  01  0)  — 

<■«  4i-p 

44 

0 3 3 

3 3 

01  01  0)  — — 

O 3 

X X 

3 01 

3 01  01 

01  01 

3 01 

01  *— 

<--4M 

•■4 

01  *— *— 

<44  44 

01  ^ 

44  44 

SA  1/1  SO 

VO  VO 

vO 

44  — 44r-  r4p.-r*-r« 

44  44  44  09  09 

pM  a 

<0  d CL  d d d 

d a 

d 

^4  A A 

AAr*  Q.  dddd 

A A S9  d d 

X X 

d » 

4 4 

XXX 

XXd^o**^* 

X X d % « 

4 4 

4 A A A A A 

A A 

A 

4 4 4 

4 4 4AAAAA 

44  4 A A 

A A 

A Q d d d d 

a a 

a 

AAA 

AAA  0.  0.  d d d 

A A d d 

d Q. 

0,44  44  44  44  — 

^ 44 

d d d 

add  W44  4444 

d d d 

w «>* 

w Cl  Cl  01  01  01 

01  01 

0) 

^ ' 

44  44  44(/)R1(|)010) 

44  44  44  (0  Ul 

o o 

01  3 3 3 3 3 

3 3 

3 

01  0)  0) 

W fO  0»33333 

0)  01  0)  3 3 

3 3 

3 Cf  Cf  Of  Of  Of 

Of  Of 

Of 

3 3 3 

‘t;330f0fOfCf4f 

■o  T3  "D  »)  « 

Of  Of 

O Cf  Cf  Of  Of  Of 

Of  Cf 

Of 

Of  Of  Of 

■■:,'  Of  vj  Of  Of  Of  Of  Of 

Of  Of  Of  Of  Of 

Of  Cf 

Of  o o o o o 

o o 

o 

O Of  Of 

CfCfCfOOOOO 

Of  Of  Of  0 0 

o o 

o o o o o o 

o o 

o 

o o o 

oooooooo 

00000 

o o 

0 3 3 3 3 3 

3 3 

3 

o o o 

00033333 

0 0 0 3 3 

3 3 
01  (0 

3 01  0)  0}  0)  01 

0)  01 

0) 

3 3 3 

01  0)  01 

333010)0)0)9) 

4)  01  (0  ^ ^ 

3 3 3 01  01 
01  01  0) 

• m#n  fA 

>C.OjQ«Q»Q.Q.O.  Qf^dOt  Of 

^ 4 • 4 

4 ^ rA  ^ <A  ^ ^ ^ d ^ ^ ^ 

1 d d d d d d d d ^ d d d d 

-’(OOfOlllUllAOlUlXOIOl'^M 
l'a‘T3'!3T3'D‘U^'0'-"0'0  ^ •O 
jOiOfOlOfOfOOfOftflOfOf'^'Of 
;OOfOfOOfOOfO»3lOOlOOf 
4UUOUOtJOUOUO^U 
fOOOOUOCfU-^OOOO 
433333333^33X3 
inoioioioioiciviouioioo) 


»<A^  fA^fArA<AfA»A^ 

LdCt^  dCbOiO.  dddd 

4 4 4 rA  44444444 

I m rA  d<A  «A  /A  rA  rA  rA  rA 

‘‘S’wtN  S £ E* ii wiw  S 

IO)tOX(/)MirtO’/)V)(/>Vl 
)'OA3— "D'O'D'D'n'n'O'U 
lOfOfOlOfOOfOOOOfOf 
lOiCfS  efOfO*Of<ucicj« 
fOUOUUfJOUU'JO 
fOU<*<UUUOUU(.fO 
133«-«33333333 
lOlClOOUllAMinvlOlO) 


^O«A<N<A^4A\0r»‘ 

«M^r4C'>«<XA«Air4nr« 

O 

O 


TO  os  o • 
rx  Al  4A  » 


CQ  • • • • 

o ^ <N 

■44  pA  (N  <N  <N 

o 

o 


^ tA  »0  f**  A)  O * 
«Nnr4<N«Nr4<N#A» 


A A A A A 

A 

A 

d d d d d 

C 

0. 

4 

4 

A 

A 

d d d d d 

C 

0 

44  ^4  44  ^4  44 

A 

44 

Of 

0)  0)  Cl  19  01 

Ai 

A 

01 

3 3 3 3 3 

Of 

4 

Q. 

3 

Of  Of  Cf  0 Of 

A 

C 

C 

Of 

Of  Of  0 Of  Of 

d 

X 

fQ 

Of 

0 0 0 0 0 

fS 

0 

X 

0 

00000 

•9 

3 

U 

0 

3 3 3 3 3 

X 

3 

01  V)  01  0)  0) 

Ul 

a 

0) 

X 

(/} 

yi 

0: 

< 

• 

H 

< 

• 

s 

A 

A A ^ 1/1  vO  A 

oc 

CO 

ro 

<M 

A 

A A A A A A 

< 

tw 

0 

Ot 

0 

06 

0 

1 

0 

n 

u 

w> 

w 

X 

d 

< 

d 

(0 

X 

• 

• 

A 

A 

1 


10 

X) 

• 

m 

x% 

CQ 

n 

IQ 

X)  X) 

« 

X) 

IQ  X>  01  X3 

r>»  r>5  so  CM  rM 

m r*» 

4r 

r-  in  so 

CM  ^ 

r5 

m ^ 

M* 

r-  cn  so 

CM  .H 

r4  c*>  O'  oM  r->  m m 

>. 

>,>,>,>.>, 

>. 

>•  >» 

> 

>•>*>< 

>.  >• 

>. 

>.  >• 

> 

>•  >4 

X3  j3  A X3  X3 

X) 

X>  X) 

X) 

JZ  JZ  JZ 

X>  JO 

x> 

^ X) 

X) 

jo  x>  ^ 

X3  X> 

A 3 X2  ^ X3  X}  X3 

e 

8 

G 

•o 

*0  ^ T)  ^ ^ 

Of 

tJ 

T)  *0 

•0 

ry  'orz 

Tf  *0 

Of  •n 

•o  *3 

TJ 

^ *3  TJ 

*3  *3 

Of'3'3’0'3'3'0^ 

c 

c c c c e 

M 

c 

c c 

c 

C G G 

c c 

U c 

c c 

c 

c c e 

c c 

MCCCCCCC 

Of 

Of  Of  Of  Of  Of 

o 

Of 

Of  Cl 

Of 

Of  Of  Of 

Of  Of 

O Of 

Of  Of 

Of 

Of  Of  Of 

Of  Of 

OOfOfOfOfOfOfOf 

Ai 

AJ  Ai  M Af  ^ 

Of 

Ai 

AJ  M 

AJ 

AJ  AJ  AJ 

AJ  AJ 

Of  AJ 

Ai  AJ 

AJ 

AJ  AJ  AJ 

Ai  AJ 

OAJAiAJAJAiAfAJ 

X X X X X 

£ 

X 

X X 

X 

XXX 

X X 

X X 

X X 

X 

XXX 

X X 

Xxxxxxxx 

Of 

Of  Of  Of  Of  Of 

AJ 

Of 

Of  Of 

01 

Of  Of  Of 

Of  Of 

AJ  Of 

Of  Of 

Of 

Of  Of  Of 

Of  Of 

Ai  01  Of  Of  OP  Of  Of-Of 

*o  c 
c 

«0  JS 

£ - 


^ c 
c • 

- r*# 
CM  C 

e • 

-o 
TJ  c 
c <0 

^ • 


H « 
• C 

C JZ 

£ o 

o »n 

-4  ^ 
^ bl 
«l 

^ Or 

O .c 
-C 


CM  CM 
C C 

c c 

SI  £1 

0 O € 
1-5  W 

w W 

01  o.  « 

JS  ^ 


•o  *0  CM  *0  *0 

c c c c c c 

fQ  >Q  • (Q  IQ  iQ 

£ £ 'O  ^ JC  jC 

^ ^ ^ ^ ^ 
c c « c c;  c 

« •XM  « » « 

(M  CM  >C  CM  CM  CM 

c c • c c c 

«4  JC  4iC  Ai  ^ 

^ • O A » % 

c c n c c c 
£ £ ^ £ i:  jc 
o o o o o 

n 2,  'JJ  S 2, 2. 

Of  0«  0«  Qi  Ou 

C £ £ 


>. 

o 

n 

c 

£ 

o 

•~3 


^ c 

C C9 
CQ  F. 
fi  3 
3 X 
£ • 
^ c 
c j: 
£ o 
o r> 
— 
^ c 
c 


^ e 6 

S w u 

W 4 (Q 

a •»  « 

•»CM  CM 
CM  c c 
c « % 
•>  c c 

C JZ  JZ 

£ o o 
o n 
n 

^ a a 

CL£.  £. 
Ji  — ^ 


MA 

X 

O' 

A 

c 

MA 

^ c 

MA 

•m4 

MA  C 

•3 

G IQ 

m 

<M 

C IQ 

c 

IQ  S 

c 

A 

IQ  8 

8 3 

A 

m 

8 3 

JZ 

3 X 

MA 

c 

3 X 

X 

A 

X « 

• c 

e 

OA 

c 

m 

e JZ 

A 

m 

C X 

c 

JZ  o 

CM 

c 

X o 

% 

O *3 

C 

A 

O *3 

*3  ^ 

MA 

»3 

c 

Ai 

c 

c 

C -M 

A 

c 

C -M 

A 

•H 

(A 

A 

•*4 

CM 

W 

CM 

c 

Of 

C 

m«A« 

MA  ^A  ^A 

O' 

am 

MA  MA 

Ai 

6 8 6 

c 

AJ 

MA  E 6 

A 

M M M 

•M 

A 

MA  Emm 

0) 

«M 

IQ  IQ  iQ 

CM 

OT 

m4 

mH  M IQ  IQ 

u 

X 

X 

AAA 

A 

U 

X 

X IQ  A A 

Of 

A 

A 

CM  CM  CM 

Of 

A 

A aC>|  CM 

O' 

(M 

C^l  ^ 

C C C 

X 

O' 

CM 

CM  -aCM  c C 

c 

c 

C 6 

AAA 

A 

c 

e 

C 8 C « « 

•M 

A 

A M 

c c c 

c 

•M4 

A 

^ * c c 

CM 

e 

C 19  £ JZ  JC 

X 

CM 

c 

C IQ  C X X 

A 

£ 

JZ  - 

o o o 

o 

A 

X 

X « X O O 

o 

O -A 

*3  *3  »3 

*3 

3 

o 

O MA  o *3  *3 

X 

•3 

*3 

AM  AM  Am 

C 

•3 

*3  ^ *3  — — ' 

«>-■» 

c 

a a a 

IQ 

^ c a a 

c 

a 

0> 

XXX 

X 

a 

a • a £ 

£ 

£ 

X CM 

Am 

ca 

A 

X 

X <M  £ WAM 

Q 

o 

c 

c 

c 

2 



-A-O 

_ — — 

■M 

o 

MA  ^ A 

IQ 

*3 

*3  C *3  *3  3 

r MA 

*3  ^ 

3 

3 C 3 3 3 

IQ  O* 

AC  *3 

c 

C IQ 

c c c 

*3 

^3 

C 

C IQ  C C C 

a r 

^ 0)  c 

IQ 

IQ  X 

IQ  IQ  IQ 

MA  C <-4  C 

IQ 

IQ  X 0)  IQ  « 

r — 

.M  — IQ  ^ 

X « 

XXX 

m4  IQ  J£  CQ  JS 

X %X  X X 

X 

X c ^ 

A 

A«M 

AAA 

X X 

(Q  X 

A 

A ^ AAA 

MI  MA 

^ X 

MA  MA  MA 

A A 

Am  a 

MA 

MA  X MA  MA  MA 

IQ  IQ 

— ^ 

^ A 

rM  ^ ^4 

MA  MA 

aMA 

^ ^ ^ ^ 

IQ  MA  MA 

•wiM 

c 

c c 

c c c 

m4  m4  X ^ 

c 

c c c c c 

^ IQ  IQ 

c 

c ^ c 

A 

• X 

AAA 

c c 

r c 

A 

a£  AAA 

eft  w w 

^ A (D  « 

CM 

CM  o CM  CM  CM 

A A 

A 

CM 

CM  O CM  CM  CM 

O'  O' 

CM 

(ft  C4  M CM 

c 

C *3 

sec 

CM  CM 

maCM 

C 

C 3 C C C 

(r*j  U 

c 

M c Of  c 

AM 

A^  Ai»  AM 

c c 

C 

AM  AM  AM  AM  AM 

U 

^ w O"-* 

Ai 

Ai  m4 

Ai  Ai  AJ 

Am  am 

•3  — 

AJ 

M m4  4J  4i  ^ 

Ai 

rT*f  C Ai 

A 

JC 

AAA 

Ai  AJ 

C Af 

A 

A A A A 

MA  MA  MA 

C A.M  A 

c 

C iO 

c c c 

A A 

IQ  * 

c 

c ui  c c e 

m IQ  IQ 

c 

•M  C CM  C 

e 

X '-’ 

XXX 

c c X e X 

X ^X  X X 

AM  AM  AM 

£ IM  ^ •>  jC 

o 

o c 

o o o 

X X 

-X 

o 

o e o o o 

O'  CT  O' 

o - O *A  o n 

»->  cn  1^  C *-5 

^ c — • a 
o«  <•  o.^ 

£ — lO  £ — 

me 

c 

O M £ 
C W W «• 
• O 01  ^ 

CM  cr  C7*  c 
c c c ^ 


^ c ^ •• 


^ tn  ^ w • 
n u ^ w 


<v  e e; 
cr  * c*<‘ 
C CM  c 

C *-4  ' 


n •-*  »-5  ^ 

Q.m  q«  cl  O* 

sz  ^ ^ c sz 

c 

^jz 

M * M W 0) 

w ^ w u w 
Cl  c (U  01  Of 
O*  • CT"  ^ W 

c ^ c c c 


o o ^ o n 

w w c — a 
o.  a * n sz 
£ £ £ 


•3  -.^4  O 


I ^ CM  C CM  «M  CM 


^ U)  V)  CM  M 
to  M M C W 
M OC  O •—  O 
Cl  O'  CT 

tr  c c G 

C -M  'M  (/I  -M 
•M  «M  CM  M CM 


Q.^  a a a 

^ ^JZ  £ JZ 
S c 
<0 

tf)  £ n • V>  M M 
M « M W M M 
Oi  *4  Of  C Of  01  Of 
CT  C O'  • ^ lyi  ^ 

c ^ c ^ c c c 

•M  A .M  <M  *M  •M  *#4 
CM  ^ CM  c CM  Cm  CM 


CM 

A 4J 

• A 

A 

A 

C « • 

A 

A A 

CM 

A 

A ftf  A 

A 

c 

A 

A 

AAA 

A 

MA  Cf 

MA 

AMA  CM 

MA 

MA  MA 

A 

MA 

-A  O'  -A 

MA 

A 

MA 

CM 

MA  MA  MA 

— 

m m m w 

m 

m 

1-4  m c m m m 

MS 

m 

m c m m 

IM 

sn 

c m m m 

m 

c u 

C Of 

c 

c 

C C A 

c 

c e 

m 

C 

C .4  c 

c 

C 

c 

A 

c c c 

c 

A 9, 

• O' 

A 

A 

A A-O 

A 

A A 

G 

A 

A «M  A 

A 

A 

A 

3 

AAA 

A 

MA  O' 

MA  C 

MA 

MA 

3 c 

MA 

MA  MA 

• 

A 

MA 

MA  A MA 

MA 

3 

MA 

C 

MA  MA  MA 

m4  e m4  ••4 

m4 

M4 

C ^ IQ  ^ 

m4  m4 

MA 

m4 

M4  *3  m4 

m4 

C ^ 

IQ  ^ m4  m4 

m4 

C -4 

C IM 

c 

c 

OCX 

c 

c e 

m4 

c 

c c c 

c 

IQ 

C £ 

c c e 

e 

* CM 

• A 

A 

A 

X » » 

A 

A A 

c 

A 

A IQ  A 

A 

£ 

A 

A 

AAA 

CM  -4  CM  CM 

«CM  m4 

CM  CM  CM 

A 

CM 

CM  £ (M  CM 

A 

CM  M4 

CM  CM  CM 

CM 

c ^ 

C X 

c 

C ^ C X 

c 

c c 

CM 

C 

c « c 

c ^ 

c 

X 

c c e 

c 

X 

AM 

AM 

X — ’ A 

AM 

AM  AM 

C 

Am 

w C AM 

X 

AM 

A 

AM  AM  AM 

Ai  C 

Ai 

Ai 

« AJ  C 

Af 

AJ  Ai 

MA 

AM 

AJ 

Ai  £ A> 

Ai 

A 

4J 

c 

Ai  Ai  AJ 

MA 

c 

— 'X 

Am 

C — X 

AM 

AM  «i^ 

>• 

Ai 

AM 

w O 

AM 

c 

AM 

£ 

AM  AM  AM 

>• 

•bJ  £ 

Ai  O 

Ai 

4j  £ 4i  o 

AJ 

Ai  Ai 

o 

AM 

AJ 

Ai  ^ Ai 

Ai 

c 

Ai 

o 

AJ  Ai  Ai 

o 

4J 

• O 

• 3 

• 

• 

o *13 

X 

Ai 

A AM  » 

o 

• 

3 

AAA 

X 

<0 

Q 


<0 
CO  A* 

oi  r 
r ^ 


IQ  IQ 
IQ  IQ 


^ ^ CM 


C C C 

•J  ij  ^ 


IQ  CQ  CQ 
O*  O'  O* 

IQ  IQ  IQ 

O'  «r  O' 


Q Q Q 


O O 
IQ  O O 


IQ  IQ  IQ 


IQ  IQ 
CM  '—'w 
K O O 


X M X 

O Q Q 


C3  ^ii. 

IQ  IQ  IQ 
O"-'^  — ' 
^ C'  O O 

Q CM  CM  CM 

O Q O 


a 

F 

IQ 

M — ' 

cii  aj 

£ • 


• C>QC'«-CCi^C'^CCC  • 
c^— C 
£0^0-x00i-40jr000£ 

oi-5-xn  u)*^on  o 

Q.'— ' CL  (Tuo.  Q*— ’ 0>Q»Q.CLaL*— 

ax:  ® ^ 


% c c ^ c 
c £ £ JC  £ 
j:  o o ti  o 

O ^ 1-5 

^ ’ c — ' 
a a*M  r)» 
ax:  £ 


_ n c c c _ 

£— iC-^JCXTiQC 
0«-40JXOOOf 

f-j  ^ w^nn  o 

W M — 

Q.'—  a a o.  a a'-' 

£ CL£  -G  £ £ £ C 


IQ  ^ ^ ^ 


X X 

: a Q a 


Q IQ  IQ  IQ 


O — W^—  WWW 


• 

• • • • 

F- 

< • • 

• 

• • 

• 

• • • 

• 

CO 

o 

-4  CM  3 ^ 

flO 

CO  O' 

o 

m4  cm 

3 

4r  m vo 

r>» 

m4 

m4  m4  m4 

< 

CM 

3 

3 3 

• 3 3 3 

3 

CM 

o* 

O 

o 

1 

O 

O 

(/) 

u 

u 

a. 

a« 

00^0  ^<M  vm«o>  r- 


V 

•M 

a 

n 

IQ 

X 

bJ 

X 

06 

a* 


: X X X 

I CL  Q«  CL 


X CM  CM  CM 
flu  Cl  (L  A« 


< • 
o 

CM  M« 

O 

o 


^ CM  m « tn  \o  r- 


<0  jQ  <0  i3  <9  ja 


^ O <Q  (0 


jQ  <Q  fO  IQ  IQ  4 
r»>r'ic'%o»A^f>*im^ 


Ml  IQ  ^ « lO  O 
»o  «H  ^ irt  rM  m 


‘OT3T3'n'0^^^‘T3*0*0 

ccccccccccc 

4lSlOOOQlQI4i4lft>Q^ 

mju^*j*jjj***j*j**^ 

XXHitXXXHHXH 

a>OQi4>oi4)C)a>ci«v 


jonja^^ja^n^xi 

e B 

O'OTJ'C'D'O’a'O’O  O^'O'Q^'D'O^'O'O'O'O 

WCCCCCCCC  WCCCCCCCCCC 

0O^ClO4iQ><U4l 

Ci4J.UAJ^U*J.U.U*J4J 
XZXXXXXXXX  £XXXXXXXXXX 

4J<tlQlVQ>VOQ>Q)  ^QiOQ>QlO<U4l<tlOO 


JO  n ja  ja  ^ xi 


O tJ  T?  *0  'O  *0  "tj 

W c c c c c c 

O Q>  O Q>  Ql  Q> 

a jj  jj  ^j  ^ 4J 
JZ  X X X X X X 

4J  0)  Q;  41  0)  V 41 


7.  Oab  theorem 

8.  (Cab|  Pa  Mxlbx2  0ax2  -lOaxl  extend  by  la 

9.  (Oabj  Pa  Hbbxl  Daxl  reduce 

10.  (Dab]  Pa  Mbbxl  (Daxl|  Max2xl  extend  by  4 


326 


Appendix  B.  Associativity  and  Commutativity  Test  Results 


B.l.  Examples 

1.  Quinlan  & Hunt  Example  1 (Chang  Example  10a) 

Axioms: 

1.  x-»-y-.y+x 

2.  x+ (y+z)  ♦ (x+y) +z 

3.  (x+y)-yx 

4.  x-*(x+y)-y 

5.  (x-y) +z-‘ (x+z) -y 

6.  (x+y) -z* (x-z) +y 

7.  x-*y  y*z  ♦ x*z 

8.  x*x 

9.  x*y  u-*x+v  •*  u-»y+v 

10.  x*y  u-x-v  ♦ u»y-v 

11.  x-*y  u^v-x  ♦ u-»v-y 

Theorem: 

12.  (a+b) +c>a+ (b+c) 

2.  Quinlan  & Hunt  Example  2 

Theorem: 

12.  (a-b)+b-*a 

3.  Quinlan  & Hunt  Example  3 

Theorem: 

12.  a*(a-b)+b 

4.  Quinlan  & Hunt  Example  4 

Theorem: 

12.  a+ (b-c) * (a+b) -c 

5.  Quinlan  & Hunt  Example  5 (Chang  Example  10b) 

Theorem: 

12.  (a-b) +c*a+ (c-b) 

6.  Quinlan  & Hunt  Example  6 

Theorem: 

12.  (a-c) - (b-c) ♦a-b 

7.  Quinlan  & Hunt  Example  7 

Theorem: 

12.  (a+c) - (b+c) -a-b 
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8.  Quinlan  & Hunt  Example  8 (Chang  Example  10c) 

Theorem: 

12.  a+ (b-c) ♦ (a-c) +b 

9.  Quinlan  & Hunt  Example  9 (Chang  Example  lOd) 

Theorem: 

12.  (a+b) -c*a+ (b-c) 

10.  Quinlan  & Hunt  Example  10 
Theorem: 

12.  (a-b)-c-(a-c)-b 

11.  Quinlan  & Hunt  Example  11 
Theorem: 

12.  a-(b+c) ♦(a-b)-c 

12.  Quinlan  & Hunt  Example  Id 
Theorem: 

12.  (a+b) -c^a- (C“b) 
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B.2.  Statistics 


Search  Strategy  Employed/ 
Options  Used 


Proof  Chains  Search 

Code  Generated  Time 

1§®£L_ 


1.  Quinlan  & Hunt  Example  1 (Chang 

501*nblit  + 100*level 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 
" - axioms  1&2  -t*  commu.  + assoc. 

nblit  + size  blits 

■ - axiom  1 + commutativity 
" - axiom  2 + associativity 

" - axioms  1&2  + commu.  + assoc. 

2.  Quinlan  & Hunt  Example  2 

501*nblit  + 100*level 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 
" - axioms  1&2  + commu.  + assoc. 

nblit  + size  blits 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 

• - axioms  1&2  + commu.  + assoc. 

3.  Quinlan  & Hunt  Example  3 

501*nblit  100*level 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 
” o axioms  1&2  -f  commu.  -»■  assoc. 

nblit  + size  blits 
" - axiom  1 + commutativity 

” - axiom  2 associativity 

" " axioms  1&2  commu.  + assoc. 

4.  Quinlan  & Hunt  Example  4 

501*nblit  + 100*level 

■ - axiom  1 + commutativity 

• - axiom  2 + associativity 

" > axioms  1&2  + commu.  assoc. 

nblit  size  blits 

■ - axiom  1 commutativity 

• - axiom  2 + associativity 

• • axioms  1&2  -f  commu.  assoc. 


Example  10a) 


A 

6 3/ 

63/ 

86/488 

12.8  + 

B 

1/ 

1/ 

1/ 

6 

0.7 

C 

1/ 

1/ 

1/ 

1 

0.2 

C 

1/ 

1/ 

1/ 

1 

0.2 

A 

13/ 

13/ 

18/ 

87 

4.7 

B 

1/ 

1/ 

1/ 

6 

0.6 

C 

1/ 

1/ 

1/ 

1 

0.2 

C 

1/ 

1/ 

1/ 

1 

0.2 
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3/ 

3/ 

4/ 

19 

1.3 

A 

3/ 

3/ 

4/ 

18 

1.1 

A 

3/ 

3/ 

4/ 

18 

0.8 

A 

3/ 

3/ 

4/ 

17 

1.3 

A 

3/ 

3/ 

4/ 

18 

1.1 

A 

3/ 

3/ 

4/ 

16 

1.2 

A 

3/ 

3/ 

4/ 

16 

0.8 

A 

3/ 

3/ 

4/ 

14 

1.0 

A 

4/ 

4/ 

5/ 

24 

1.0 

A 

5/ 

5/ 

6/ 

28 

1.4 

A 

4/ 

4/ 

5/ 

22 

1.2 

A 

5/ 

5/ 

6/ 

25 

1.5 

A 

3/ 

3/ 

4/ 

14 

0.7 

A 

3/ 

3/ 

4/ 

13 

0.7 

A 

3/ 

3/ 

4/ 

13 

1.0 

A 

3/ 

3/ 

4/ 

12 

0.8 

A 

280/280/378/2154 

140.2 

B 

1/ 

1/ 

1/  3 

0.3 

C 

82/ 

82/110/547 

54.0 

B 

1/ 

1/ 

1/  3 

0.3 

A 

155/155/224/1154 

83.8 

B 

1/ 

1/ 

1/  3 

0.3 

D 

23/ 

23/ 

32/152 

11.9 

B 

1/ 

1/ 

1/  3 

0.3 
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Search  Strategy  Employed/  Proof  Chains  Search 

Options  Used  Code  Generated  Time 

XSSSL- 


5.  Quinlan  & Hunt  Example  5 (Chang 

501*nblit  + 100*level 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 
" - axioms  1&2  + coramu.  + assoc. 

nblit  + size  blits 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 
" - axioms  1&2  + commu.  + assoc. 

6.  Quinlan  & Hunt  Example  6 

501*nblit  + 100*level 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 
" - axioms  lii2  + commu.  + assoc. 

nblit  + size  blits 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 
" - axioms  1&2  + commu.  + assoc. 

7.  Quinlan  & Hunt  Example  7 

501*nblit  + I00*level 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 
" - axioms  1&2  + commu.  + assoc. 

nblit  size  blits 
" - axiom  1 + commutativity 
" - axiom  2 associativity 
” - axioms  1&2  commu.  assoc. 

8.  Quinlan  & Hunt  Example  8 (Chang 

501  100 

■ - axiom  1 + commutativity 
* - axiom  2 + associativity 
" - axioms  1&2  -f  commu.  + assoc. 

nblit  + size  blits 
" - axiom  1 + commutativity 
" - axiom  2 + associativity 
” - axioms  1&2  + commu.  + assoc. 


Example  10b) 


A 

49/  49/  64/373 

9.3 

B 

5/  5/  6/  27 

2.2 

A 

63/  63/  80/413 

33.9 

B 

5/  5/  6/  24 

2.0 

A 

12/  12/  17/  85 

5.7 

B 

3/  3/  4/  12 

0.9 

A 

12/  12/  17/  74 

5.5 

B 

3/  3/  4/  11 

>500 

0.9 

A 

277/277/359/1803 

>500 

190.7 

A 

260/260/334/1431 

>500 

>500 

>500 

>500 

>500 

165.3 

A 

21/  21/  26/137 

11.7 

B 

17/  17/  21/112 

9.5 

C 

17/  17/  22/  95 

9.5 

D 

95/  95/131/671 

53.2 

E 

45/  45/  61/292 

22.5 

F 

74/  74/106/452 

43.6 

C 

8/  8/  10/  42 

3.6 

Example  10c) 

A 

2615/2615/3481/20145 

615. 

B 

5/  5/  6/  27 

>500 

2.3 

B 

5/  5/  6/  24 

1.5 

C 

285/285/407/2100 

168.9 

B 

3/  3/  4/  12 

1.1 

D 

21/  21/  30/134 

7.5 

B 

3/  3/  4/  11 

0.9 
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9.  Quinlan  & Hunt  Example  9 (Chang  Example  lOd) 


501  100 

A 

48/ 

48/ 

63/364 

11.4  + 

Iff 

- axiom  1 + commutativity 

B 

1/ 

1/ 

1/  2 

0.3 

«• 

- axiom  2 + associativity 

A 

59/ 

59/ 

76/387 

32.7 

Iff 

- axioms  1&2  + commu.  + assoc. 

B 

1/ 

1/ 

1/  2 

0.3 

nblit  + size  blits 

A 

12/ 

12/ 

17/  83 

5.2 

Iff 

- axiom  1 + commutativity 

B 

1/ 

1/ 

1/  2 

0.5 

Iff 

- axiom  2 + associativity 

A 

12/ 

12/ 

17/  72 

5.0 

Iff 

- axioms  1&2  + commu.  + assoc. 

B 

1/ 

1/ 

1/  2 

0.4 

10. 

Quinlan  & Hunt  Example  10 

501*nblit  + 100*level 

A 

30/ 

30/ 

38/228 

14.2 

m 

- axiom  1 + commutativity 

A 

34/ 

34/ 

44/224 

17.2 

Iff 

- axiom  2 associativity 

A 

30/ 

30/ 

38/200 

15.6 

Iff 

- axioms  1&2  + commu.  + assoc. 

A 

3 4/ 

34/ 

44/192 

13.4 

nblit  + size  blits 

A 

13/ 

13/ 

16/  90 

5.2 

Iff 

- axiom  1 + commutativity 

A 

13/ 

13/ 

16/  79 

5.9 

Iff 

- axiom  2 + associativity 

A 

13/ 

13/ 

16/  79 

5.4 

Iff 

- axioms  1&2  + commu.  + assoc. 

A 

13/ 

13/ 

16/  68 

4.6 

11. 

Quinlan  & Hunt  Example  11 

501*nblit  + 100*level 

>500 

Iff 

- axiom  1 + commutativity 

A 

256/256/329/1673 

172.5 

a 

- axiom  2 associativity 

B 

130/130/172/866 

63.1 

a 

- axioms  1&2  + commu.  + assoc. 

C 

31/ 

31/ 

40/173 

28.1 

nblit  + size  blits 

D 

326/326/440/2335 

172.6 

a 

- axiom  1 + commutativity 

E 

18/ 

18/ 

22/118 

7.1 

a 

- axiom  2 + associativity 

F 

68/ 

68/ 

98/433 

38.8 

a 

- axioms  1&2  commu.  + assoc. 

C 

13/ 

13/ 

16/  70 

5.3 

12. 

Quinlan  & Hunt  Example  18 

501*nblit  + 100*level 

>500 

a 

- axiom  1 + commutativity 

A 

316/316/405/2064 

247.3 

a 

- axiom  2 + associativity 

>500 

a 

- axioms  1&2  + commu.  assoc. 

A 

292/292/372/1614 

169.4 

nblit  size  blits 

>500 

a 

- axiom  1 + commutativity 

>500 

a 

- axiom  2 -f  associativity 

B 

490/490/763/3092 

334.9 

a 

- axioms  1&2  + commu.  + assoc. 

C 

44/ 

44/ 

60/238 

23.6 
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